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researchers in the emerging field of neuromodulation. Findings already show improved outcomes 
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will have their essay published online. For additional inquiries, email SciencePINSPrize@aaas.org. 
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with the Zhejiang Lab Brain Intelligence Team 


J. — 
Tianzi Jiang (back row, center) 


A bottom-up approach to brain-inspired Al 


Scientists at Zhejiang Lab are learning more about neurobiology, and turning that 
knowledge into more powerful artificial intelligence. 


In the 1940s, scientists started developing simple electrical circuits intended 

to mimic the interactions of neurons in the brain. Those early neural networks 
evolved into computer models of artificial intelligence (Al) in the 1950s. Since then, 
neuroscience and Al have progressed along parallel paths. 

Tianzi Jiang is a senior research specialist at China-based Zhejiang Lab's 
Research Center for Human-Machine Augmented Intelligence; he and his 
colleagues are developing new brain-inspired Al systems driven by data and 
knowledge. By working on computability, the generalization of Al, and other 


areas, these scientists aim to improve Al cognition, decision making, and learning. 


For example, Zhejiang Lab has created the Large-Scale Humanoid Intelligence 
Knowledge Base, which can be applied to Al-based knowledge mining, 
memorization, and more. 

“Recent scientific efforts have offered some insights by examining how the 
objectively measured patterns of brain organization and function can inspire the 
design of the artificial neural network,” says Jiang. 

To help scientists develop and test new methods and technologies, Zhejiang Lab 
recently built its Dubhe Al open-source platform. This system arises from a high- 
performance, distributed-computing architecture that integrates one-stop Al model 
development, a high-performance deep-learning framework, and a knowledge- 
refining platform. The features of this platform include a complete parallel mode, 
automatic arrangement and execution of computing steps, and high operational 
efficiency and stability. Scientists can apply this platform to research on Al-based 
learning and decision making, which drives a new-generation of Al technology 
based on human-like knowledge and autonomous learning. 

“Instead of treating the Al system as a black box,” says Jiang, “scientists at the 
Zhejiang Lab keep bottom-up options open to establish a biologically plausible Al 
system inspired by the multiscale principles of brain organization and function to 
gain access to the realm of biological intelligence.” 


Avision of new brain-inspired Al models 

By studying the organization and activity patterns in the brain, scientists can 
incorporate that knowledge in new artificial neural networks. “A promising example 
comes from the correspondence between the activities of a convolutional neural 
network and the human ventral visual pathway during the processing of the same 
images,” Jiang explains. "When the input images are specified to contain face stimuli 
only, the activity of the face-selective area in the human brain can be well predicted 
by the activity of face-selective units in the model.” 

Although better understanding of the brain promises even better Al-based 
applications, Jiang points out that “the brain is highly heterogeneous at different 
spatiotemporal scales, and exploring it requires an efficient approach—just like 
a nautical chart [is necessary] for a navigator—to integrate different pieces of 
knowledge to form a more complete landscape.” 

To provide such a chart, Jiang's team created the Brainnetome Atlas, which is 
described as “an in vivo map, with more fine-grained functional brain subregions 
and detailed anatomical and functional connection patterns for each area.” From 
this, Jiang and his colleagues developed a Brainnetome-inspired Al system, in which 
he says a digital-twin brain can “achieve a biologically plausible neural network to 
understand and process both data and knowledge in a humanoid way.” 

Zhejiang Lab scientists are far ahead of their predecssors who built the neural 
networks of the 1940s—yet they are only getting started in advancing and applying 
brain-mimicking Al. As Jiang says, “By leveraging the knowledge accumulated in 
brain studies, more brain processes can be included to inspire the design of artificial 
neural networks, which will hopefully push the boundary of our understanding of 
human intelligence.” 
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1028 News at a glance 


1030 Alzheimer’s drug stirs 
excitement—and concerns 
Antibody slows cognitive 
decline, but deaths, brain bleeds, 
and swelling mar results 

By J. Couzin-Frankel and C. Piller 


1032 NASA radar altimetry mission 

to study hidden ocean swirls 

Enhanced resolution of SWOT satellite will 
highlight how small eddies soak up heat and 
carbon By P. Voosen 


1033 Image problems besiege 

Stanford president 

Marc Tessier-Lavigne's early papers are 
subject of school and journal investigations 
By J. Kaiser 


1034 In China, ‘zero COVID’ has 
become a Catch-22 

Population chafes at control measures, 
but lifting them now would carry huge risks 
By D. Normile 
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1035 New U.S. law aims to light up 
medical research on cannabis 

Biden signs bill streamlining pot studies 
and production By M. Wadman 


1036 Hero city 

Kharkiv was Ukraine's science hotbed 
until Russia attacked. The crippled city 
refuses to give up By R. Stone 


: 1039 The volunteer By R. Stone 


: 1040 The caretaker By R. Stone 
PODCAST 


1042 Out of the ashes 

Slavutych was built after the nuclear 
meltdown at Chornobyl. The town, scarred 
by war, is planning a research revival 

By R. Stone 

PODCAST 


INSIGHTS 


1044 New books for young scientists 


1051 The therapeutic potential 

of psychedelics 

The development of psychedelics 

as medicines faces several challenges 
By E.A. D. Schindler and D. C. D'Souza 


Russian missiles and artillery 
pummeled Kharkiv, Ukraine’s major 
science center, earlier this year. 


1053 Herbaria reveal cost of the 

green revolution 

Rapid weed evolution is exposed by genome 
sequencing of natural history collections 

By K. Waselkov and K. M. Olsen 

RESEARCH ARTICLE p.1079 


1054 Getting bifunctional molecules 
into cells 

A class of transmembrane proteins helps 
shuttle large drugs across the cell membrane 
By R. S. Lokey and C. Pye 

REPORT p. 1097, 


science.org SCIENCE 


CREDITS: (PHOTO) KYODO NEWS VIA AP IMAGES; (ILLUSTRATION) CHIARA VERCESI 
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1056 AlphaCode and “data-driven” 
programming 

Is ignoring everything that is known about 
code the best way to write programs? 

By J. Z. Kolter 

RESEARCH ARTICLE p. 1092 


POLICY FORUM 


1057 Fairness considerations in global 
mitigation investments 

Current mitigation finance flows are 
inadequate and unfair By S. Pachauri et al. 


LETTERS 


1060 Data-based tools can prevent 
crowd crushes 
By M. Haghani and R. Lovreglio 


1060 Co-management of Chile’s escaped 
farmed salmon 
By G. Figueroa-Mufioz et al. 


1061 Long-term solutions for China’s 
heat and drought 
By X. Guo et al. 


RESEARCH 


IN BRIEF 
1062 From Science and other journals 


REVIEW 


1065 Batteries 

Designing better electrolytes Y. S. Meng et al. 
REVIEW SUMMARY; FOR FULL TEXT: 
DOI.ORG/10.1126/SCIENCE.ABQ3750 


RESEARCH ARTICLES 


1066 Systems biology 

Principles of gene regulation quantitatively 
connect DNA to RNA and proteins in bacteria 
R. Balakrishnan et al. 


RESEARCH ARTICLE SUMMARY; FOR FULL TEXT: 
DO!LORG/10.1126/SCIENCE.ABK2066 


1067 Computer science 

Human-level play in the game of Diplomacy 
by combining language models with strategic 
reasoning Meta Fundamental Al Research 
Diplomacy Team (FAIR) 


1074 Geology 

Anew biologic paleoaltimetry indicating Late 
Miocene rapid uplift of northern Tibet Plateau 
Y. Miao et al. 


Common waterhemp (Amaranthus tuberculatus) is native to North America, where it has become a prominent 
agricultural weed in the last half century as it evolved in response to intensive farming practices. 


1079 Evolutionary ecology 

Rapid weed adaptation and range expansion 
in response to agriculture over the past two 
centuries J. M. Kreiner et al. 

PERSPECTIVE p. 1053 


1085 Organic chemistry 

Catalytic asymmetric C-H insertion reactions 
of vinyl carbocations 

S. K. Nistanaki et al. 


1092 Computer science 
Competition-level code generation 
with AlphaCode Y. Liet al. 
PERSPECTIVE p. 1056 


REPORTS 


1097 Chemical genetics 

IFITM proteins assist cellular uptake of 
diverse linked chemotypes K. Lou et al. 
PERSPECTIVE p. 1054 


1105 Martian geology 

Aqueous alteration processes in Jezero crater, 
Mars—implications for organic geochemistry 
E. L. Scheller et al. 


1111 Antibiotic resistance 

Tuberculosis treatment failure associated 
with evolution of antibiotic resilience 

Q. Liu et al. 


1118 Crystallization 
Liquid metal synthesis solvents for metallic 
crystals S.A. Idrus-Saidi et al. 


1125 Piezoelectrics 

Giant electric field—induced 
strain in lead-free piezoceramics 
G. Huangfu et al. 


DEPARTMENTS 


1027 Editorial 
Thank you, Tony! 
By H. H. Thorp 
PODCAST 


1134 Working Life 
Working through grief 
By M. Schultz 


ON THE COVER 


This image shows blocks with snippets of 
programs of varying viability (height) created 

by humans (purple) or by AlphaCode (white). 
AlphaCode is an artificial intelligence system that 
writes computer programs at a competitive level 
and solves new problems 
by generating and filter- 
ing millions of diverse 
candidates. In the future, 
code generation systems 
like AlphaCode may assist 
and educate programmers. 
See pages 1056 and 1092. 
Credit: DeepMind 
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EDITORIAL 


Thank you, Tony! 


nthony Fauci—“Tony” to friends, colleagues, and 
many journalists—has never backed down from 
controversy. During the worst of the AIDS crisis, 
when HIV infection was a death sentence, he was 
targeted by protestors frustrated by the US gov- 
ernment’s sluggish response. Instead of ignoring 
their complaints, Fauci listened, acted, and even- 
tually came to be seen as an ally to activists, all the while 
battling charlatans who tried to insist that HIV didn’t 
cause AIDS. When the COVID-19 pandemic came along, 
the long-time director of the National Institute of Aller- 
gy and Infectious Diseases was a natural choice to help 
lead the country’s response, and he soon became one of 
the most recognizable physician-scientists in the world. 
Fauci recently announced that he 
would end his time in the federal 
government in December 2022. 
He sat down with me to talk about 
the challenges facing science and 
his plans for the future. The full 
transcript of our interview, which 
is filled with valuable insights 
into science policy and commu- 
nication, is posted on the Editor’s 
Blog, but I wanted to share a few 
highlights here. 

We talked a lot about misin- 
formation and the forces sowing 
doubts about COVID-19 vaccines 
and interventions such as masks 
and lockdowns. When I asked 
him how this round of science de- 
nial compared with what he saw during the HIV/AIDS 
crisis, he agreed that the two shared common themes, 
but argued that our current crisis is fundamentally dif- 
ferent, due largely to powerful platforms offered to anti- 
science actors by social media and other channels. “I 
always say, “The best way to counter misinformation is to 
flood the system with correct information,” he told me. 
“That’s still true, but you're fighting against a big surge 
of misinformation.” 

We agreed that one of the biggest challenges with 
which the scientific community must grapple is misin- 
formation that comes from within. “I ran into that like a 
stone wall at a congressional hearing,” he said, describ- 
ing how his skepticism of an argument put forward by 
researchers who questioned the necessity of COVID-19 
lockdowns was met with admonishment. “Youre arro- 


“As a scientist, 
you have an 
obligation...to speak 


the truth and 
not be evasive...” 


Anthony Fauci, 
National Institute of Allergy 
and Infectious Diseases 


gant because you're disrespecting other scientists,’ he 
was told. Fauci’s response: “You don’t want to be being 
disparaging of anyone, but you’ve got to come back and 
push back against things that are not true.” 

We also talked about how we need to do a better job 
of conveying that science is a process and not a col- 
lection of facts set in stone. I was trying to stay away 
from the pushback Fauci received each time pandemic 
guidance had to be revised, but it hung in the air, un- 
spoken. “For those who don’t understand the iterative 
nature of science, you are flip-flopping, and you’ve un- 
dermined the entire scientific process,” he observed. 
“What we can do better is articulate to people that 
when we tell them something based on the evidence 
we have now, that we are dealing 
with an evolving situation.” 

Unlike many of his govern- 
ment peers, Fauci has always 
been forthcoming with the me- 
dia. “As a scientist, you have an 
obligation...to speak the truth 
and not be evasive, because 
that’s antithetical to the scien- 
tific principles,” he told me. “And 
when I talk to my scientific col- 
leagues in government, often 
they’re afraid to say something, 
and I’m telling them behind the 
scenes, ‘But what’s the problem? 
Youre not going to lose your job 
for telling the truth’ And I al- 
ways found that that has served 
me very, very well.” In my opinion, his preference for 
straight talk over carefully crafted statements has also 
served the public well. 

When I asked what was next, Fauci was scrupulous 
about the fact that it couldn’t be worked out until he left 
government, but he gave me the broad strokes. “I want 
to be able to write. I want to be able to lecture. I want to 
be able to advise when asked for advice,” he said. “And 
[I want to] use the benefit of my experience to do two 
things: to help people and organizations, including the 
federal government, or universities, who could benefit 
from my experience, but also, as important, to inspire 
young people to either go into science, or if they are in 
science, to pursue it in a manner that benefits the pub- 
lic health and to do it with public service.” 

-H. Holden Thorp 


H. Holden Thorp 
Editor-in-Chief, 
Science journals. 
hthorp@aaas.org: 
@hholdenthorp 
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CLIMATE SCIENCE 


Volcano and NASA deliver blows to climate monitoring 


An erupting fissure 
high on Mauna Loa’s 
Northeast Rift Zone. 


fforts to monitor global greenhouse gas emissions suf- 
fered two setbacks last week—one by chance, one by 
choice. In Hawaii, the first eruption of the Mauna Loa 
volcano since 1984 has cut off road access and power 
to a famed summit lab that has monitored atmospheric 
carbon dioxide (CO,) levels since 1958. Although lava 
flows have so far spared the lab, which is run by the National 
Oceanic and Atmospheric Administration (NOAA), measure- 
ments are unlikely to resume for several months. That means 


tracking data will have to be sustained by other NOAA-run sta- 
tions around the world. NASA, meanwhile, canceled develop- 
ment of its troubled GeoCarb mission, designed to track CO, 
levels from geostationary orbit. The mission’s price tag had 
ballooned from $171 million to more than $600 million since 
2016. NASA said new technology, including a hyperspectral 
instrument mounted on the International Space Station this 
year, has created alternatives to GeoCarb and it will begin work 
later this decade on a new greenhouse gas monitoring satellite. 


Antiparasite pill impresses 


NEGLECTED DISEASES | A new drug that 
can treat human African trypanosomiasis— 
commonly known as sleeping sickness—with 
just one dose has shown promise in a 
clinical trial in the Democratic Republic 

of the Congo and Guinea. The rare disease 
is caused by the parasite Trypanosoma 
brucei gambiense, which is transmitted by 
the tsetse fly. Left untreated, it is deadly. 
Until a few years ago, treatment required 
hospitalization and a series of injected 
drugs—difficult to administer in the 
remote regions where the disease occurs. 
In 2019, African nations began using a 
treatment that required 10 daily pills. 
However, the new drug, called acozi- 
borole, could be even simpler. In a trial 
in 208 patients, the one-dose pill cured 
95% of the people treated, even if they 
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were suffering from late-stage disease. 
Researchers are now conducting a placebo- 
controlled trial with 900 participants to 
collect more safety data. 


Avian flu strikes South America 


EPIDEMIOLOGY | Migratory birds in South 
America are now spreading the highly 
pathogenic H5N1 avian flu virus that has 
already led to the deaths of record num- 
bers of poultry and wild birds in Europe 
and North America. Last week, Peru, 
Ecuador, and Venezuela declared agricul- 
tural emergencies and instructed farmers 
to slaughter poultry at infected farms. In 
Colombia, H5N1 was found last month in 
a backyard flock. Galapagos National Park 
is monitoring wild birds for infections. 

In Peru, more than 10,000 pelicans have 
died of the extremely contagious virus. 


Meanwhile, the agriculture ministry in 
Mexico—where commercial farms have 
slaughtered nearly 4 million birds since 
October—announced more than 41 million 
doses of poultry vaccine have been deliv- 
ered to farmers in high-risk regions. 


A shortcut for particle physics 


MATH | For decades, theoretical particle 
physicists have struggled with vexing cal- 
culus problems called Feynman integrals, 
which are central to nearly every calcula- 
tion of how subatomic particles interact. 
Now, theorists in China have found 

a general numerical method to solve 

the integrals much more quickly. The 
approach transforms them into differen- 
tial equations, which can be solved easily 
by a computer—although the method 
depends on a key new insight and lots and 
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lots of linear algebra. Published last week 
in Physical Review Letters, the method 
should be widely useful, including in mak- 
ing predictions to be tested at the world’s 
biggest atom smasher, Europe’s Large 
Hadron Collider. A software package 
deploying the method is already getting 
heavy use, developers say. 


Huge telescope to get underway 


RADIO ASTRONOMY | Construction will 
soon begin on the world’s biggest radio 
telescope, the Square Kilometre Array 
Observatory. Officials at the twin sites 

in South Africa and Western Australia 
announced contracts to install founda- 
tions, power, and fiber connections for the 
131,072 Christmas tree-shaped antennas 
that will detect low frequencies and the 
133 dishes to monitor higher frequencies. 
The dishes will be added to the existing 
64 dishes of South Africa’s MeerKAT array 
in a configuration extending across hun- 
dreds of square kilometers. The €2 billion 
facility, to be completed by the end of the 
decade, will probe the universe’s first stars 
and galaxies, study cosmic magnetism and 
gravity, and listen for alien civilizations. 


Daunting biodiversity talks 


CONSERVATION | Delegates who arrived 
in Montreal this week for a key summit 
meeting on conserving global biodiver- 
sity face a heavy workload. Parties to the 
Convention for Biological Diversity must 
finalize a 261-page document that contains 
some 1800 words and phrases in brack- 

ets because negotiators have yet to reach 
agreement on pollution control targets, 
agricultural reforms, and other issues. The 
meeting is scheduled to end on 19 December. 


Meet a digital Blue Marble 


CLIMATE SCIENCE | Fifty years ago this 
week, astronauts aboard Apollo 17, NASA’s 
last crewed mission to the Moon, took 

an iconic photograph that became known 
as the Blue Marble—the first color photo 
taken by a person that shows Earth’s full 
day side. This week, researchers led by 

the Max Planck Institute for Meteorology 
re-created that image during a test run 

of a cutting-edge digital climate model. 

The model can simulate climatic phenom- 
ena, such as storms and ocean eddies, at 
1-kilometer resolution, as much as 100 times 
sharper than typical global simulations. 
To duplicate the swirling winds of the 
Blue Marble—including a cyclone over 
the Indian Ocean—the researchers fed 
spotty weather records from 1972 into the 
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Aclimate model created this replica of 
the iconic 1972 Blue Marble photo of Earth. 


supercomputer-powered software, which 
can handle time spans of just a few days. 
Researchers say the stunt highlights the 
growing sophistication of high-resolution 
climate models, which are expected to 
form the core of the European Union’s 
Destination Earth project, which aims to 
create a “digital twin” of Earth (Science, 
2 October 2020, p. 16). 


Astone panel 

found in Turkey depicts 

people interacting 

with fearsom e wild animals. = 


igh 


ARCHAEOLOGY 
ANeolithic narrative? 


NASA honors UV researcher 


ATMOSPHERIC SCIENCE | NASA said last 
week it is renaming an atmospheric science 
probe known as GLIDE after the pioneer- 
ing astronomer George Carruthers. The 
Carruthers Geocorona Observatory, set for 
launch in 2025, will study Earth’s upper 
atmosphere, where neutral hydrogen gas 
scatters far-ultraviolet (far-UV) radiation 
from the Sun. Carruthers, who died in 2020, 
pioneered UV astronomy in the 1960s, 
building cameras that were lofted above 
the UV-blocking atmosphere with sound- 
ing rockets to get brief glimpses of the sky. 
The instruments showed that molecular 
hydrogen exists in the voids between stars. 
Carruthers also built a camera that, in 1972, 
was operated on the Moon by Apollo 16 
astronauts, taking the first UV images of 
Earth’s geocorona. He later snapped Halley’s 
Comet in UV and built a camera for the 
Space Shuttle. The decision appears to mark 
the first time NASA has named a space 
mission after a Black scientist. Carruthers 
also worked to promote and mentor young 
Black researchers and students interested 
in careers in science. 


stone panel carved some 10,500 years ago by Neolithic hunter-gatherers in what 

is now southern Turkey is the region's oldest known example of a story told in art, 

an archaeologist claimed this week. Discovered in 2021 in the village of Saybure, 

the 3.7-meter-long panel portrays two human figures encountering wild animals. 

One, with six fingers, crouches beside a sharp-horned bull. The other, flanked by 
two snarling leopards, clutches an erect penis. Researchers caution it is very difficult 
to know whether the ancient artists intended to tell a story—and what it might be. But 
unlike other Neolithic carvings from the region, this panel shows characters interacting 
and appears to have “narrative integrity,” archaeologist Eylem Ozdogan of Istanbul 
University wrote in Antiquity, including “a progressing scene [that] ... suggests that one 
or more related events or stories are being told.” 
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Despite safety issues, an antibody that 
appears to slow Alzheimer’s disease could 
offer hope to patients with few options. 


Alzheimer’s drug stirs excitement—and concerns 


Antibody slows cognitive decline, but deaths, brain bleeds, and swelling mar results 


By Jennifer Couzin-Frankel and Charles Piller 


festive air pervaded an Alzheimer’s 

disease conference last week in San 

Francisco, where attendees cheered 

what several described as a monu- 

mental breakthrough: the first treat- 

ment to clearly, albeit modestly, slow 
the disease’s seemingly inexorable cognitive 
decline. In so doing, some scientists say, it 
substantiated the long-held hypothesis that 
the brain protein known as beta amyloid 
is a major driver of the illness. The news is 
“a celebration for all of us in the field,’ said 
neuroscientist Maria Carrillo, chief science 
officer of the Alzheimer’s Association, at 
the meeting. 

But the buoyant mood was tempered by 
doubts about how well the experimental ther- 
apy, an antibody called lecanemab, worked in 
its pivotal clinical trial—especially in people 
who carry two copies of APOE4, a gene vari- 
ant that predisposes them to Alzheimer’s. 
Safety was also a concern, with scientists 
debating the treatment’s risks of brain swell- 
ing and bleeding. Some have linked the drug 
to the deaths of two trial participants and to 
serious brain injuries in others, although the 
Japanese biotech Eisai Co., lead sponsor of 
the clinical trial, suggested lecanemab was 
not to blame. 

Dennis Selkoe, a Harvard University 
neuroscientist and a longtime supporter of 


1030 9 DECEMBER 2022 * VOL 378 ISSUE 6624 


the amyloid hypothesis, says he has waited 
decades for this moment. “For the first time 
in my career, I see objective evidence” that re- 
ducing amyloid in the brain produces better 
cognitive outcomes. “I’m on the right side of 
history.’ Besides vindicating the amyloid hy- 
pothesis, Selkoe says, the trial results herald a 
promising treatment. We don’t “need perfec- 
tion to offer this to patients.” 

Others strongly disagreed with both con- 
clusions. “I do not think the benefits seen in 
this trial clearly outweigh the risks and de- 
spite the general excitement around a pos- 
sible new treatment, I will be advising my 
patients to keep waiting,” Matthew Schrag, a 
Vanderbilt University physician and neuro- 
scientist, tweeted last week. 

Eisai and its partner Biogen designed 
lecanemab to remove certain forms of beta 
amyloid, which clumps between brain cells 
in people with Alzheimer’s and is widely 
thought to cause the disease’s memory- 
robbing neurodegeneration. Other experi- 
mental therapies, including antibodies, have 
removed amyloids, but patients showed little 
sign of benefiting from those earlier drugs. 

In presentations at last week’s Clinical 
Trials on Alzheimer’s Disease (CTAD) con- 
ference, along with a paper in The New Eng- 
land Journal of Medicine, the companies 
and several researchers expanded on a Sep- 
tember press release that briefly detailed 
the results of the phase 3 lecanemab trial, 


which enrolled 1795 early-stage Alzheimer’s 
patients. The talks and paper confirmed 
that lecanemab, given by intravenous infu- 
sion every other week, on average slowed 
the patients’ rate of cognitive decline by 27% 
after 18 months, compared with people on a 
placebo. That corresponded to a 0.45-point 
separation on the 18-point dementia scale 
used to assess participants. 

It’s a “highly statistically significant” dif- 
ference, said Christopher van Dyck, director 
of the Alzheimer’s Disease Research Unit 
at Yale University and a study leader. The 
antibody also thoroughly mopped up beta 
amyloid, according to brain scans. 

But neurologists disagree on whether the 
reported slowing in cognitive decline would 
be perceptible to many patients or caregiv- 
ers. And they are keen to see whether the 
drug maintains its benefits. “What happens 
in the next 18 months?” David Knopman, 
a neurologist at the Mayo Clinic, said at a 
CTAD panel discussion. “Will the effect ex- 
pand, which is what we hope, will it stabi- 
lize, or will it decline?” 

Knopman and other scientists also point 
out that not all participants appeared to 
benefit equally. Women and people under 
age 65 showed no significant boost from 
the antibody, for example. The subgroups 
analyzed were not large enough to draw firm 
statistical conclusions, Selkoe counters. 

The randomized portion of the lec- 
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anemab trial ended last year, and both the 
placebo and treated groups were then of- 
fered the chance to join an “extension” trial 
in which all received lecanemab. Together, 
the trials are helping bring the therapy’s 
risks into sharper focus. Like other anti- 
bodies that target amyloids, lecanemab 
sometimes produced a form of brain swell- 
ing or bleeding, called amyloid-related im- 
aging abnormalities (ARIA). Routine MRI 
scans showed roughly 21% of those on lec- 
anemab had such side effects, compared 
with just over 9% of those on placebo. Many 
did not notice any symptoms. 

Still, 3.2% of 898 participants on the anti- 
body in the recent trial developed ARIA 
with symptoms, including headache, confu- 
sion, and visual problems. (In the placebo 
group, 0.2% experienced this.) Schrag 
and others fear the bleeding risk is es- 
pecially acute for Alzheimer’s patients 
also taking anticoagulants, which 
interfere with clotting and are often 
prescribed to older people for various 
health problems. (Eisai declined to 
disclose full data on ARIA in partici- 
pants who took anticoagulants while 
on lecanemab.) 

Adding to the concern, two patients 
in the extension trial died following 
brain hemorrhages; both were on anti- 
coagulants. One was a 65-year-old 
physically active woman with minimal 
cognitive decline. In an exclusive inter- 
view with Science recently, the woman’s 
spouse described the catastrophe that 
unfolded after she arrived at North- 
western University Medical Center 
with a suspected stroke. Doctors gave her 
a common stroke treatment, the powerful 
clot-busting medication tissue plasminogen 
activator (tPA). In short order, she suffered 
massive bleeding throughout her brain’s 
outer layer, according to an unpublished 
case report by Northwestern physicians, ob- 
tained by Science. She died a few days later 
without regaining consciousness. 

“There’s zero doubt in my mind that this 
is a treatment-caused illness and death. 
If the patient hadn’t been on lecanemab 
she would be alive today,’ says Rudolph 
Castellani, a Northwestern neuropatho- 
logist who studies Alzheimer’s and con- 
ducted the patient’s autopsy at her hus- 
band’s request. Although tPA does carry a 
risk of brain bleeds, including fatal ones on 
rare occasions, independent experts who 
reviewed the unpublished paper for Science 
said lecanemab likely played a key role in 
the hemorrhage and death. (Castellani, 
whom the husband authorized to speak to 
Science about his wife’s case, says his com- 
ments reflect his personal views, not those 
of Northwestern.) 
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At last week’s meeting, Marwan Sabbagh of 
the Barrow Neurological Institute presented 
further data on the trial’s adverse events, 
including brain “macrohemorrhages” more 
than 1 centimeter in diameter. Such bleeding 
occurred in five out of 140 people—or 3.6%— 
on both lecanemab and anticoagulants, a far 
higher rate than in people only getting the anti- 
body. One of those five was the woman who 
had the stroke; a second was an 87-year-old 
man whose death was reported in October by 
STAT. A third macrohemorrhage patient us- 
ing anticoagulants experienced a devastating 
brain injury, according to a September paper 
by French researchers involved in the trial. 

Besides taking anticoagulants, both 
people who died turned out to have cere- 
bral amyloid angiopathy (CAA), a condition 


Anew antibody targets the protein beta amyloid, which forms 
clumps (orange in this artist’s concept) between brain cells. 


in which amyloid builds up around brain 
blood vessels. In attacking amyloid plaques, 
antiamyloid antibodies also strip away the 
amyloid around blood vessels—and, in do- 
ing so, can weaken them, increasing the 
risk of bleeds. CAA can be hard to diagnose 
before death and afflicts about half of Al- 
zheimer’s patients—particularly those who 
carry two copies of APOEF4. 

At the CTAD conference, Sabbagh dis- 
puted that lecanemab has precipitated fatal 
brain injuries, suggesting the deaths were 
caused by tPA complications in the woman 
and a heart condition in the older man. But 
Nicolas Villain, a neurologist at Sorbonne 
University and an investigator on the lec- 
anemab trial, is more uneasy. He was on 
the French team that described severe brain 
injuries—a macrohemorrhage and a serious 
brain swelling—in two participants on anti- 
coagulants and urges great caution when 
mixing lecanemab and blood thinners. 

Villain especially worries that Alzheim- 
er’s patients who have strokes while on 
lecanemab could die if treated with tPA, 
like the 65-year-old woman. Regulators 


“should take this case report seriously into 

account,” agreed Andreas Charidimou, a 

neuroscientist at Boston University. “When 

there’s so many unknowns it’s better to be 
more conservative.” 

Pinning down the causes of deaths and 
severe side effects in trials can be difficult, 
some scientists stressed. Others argued 
that people with early Alzheimer’s should 
be allowed to make their own choices 
about risks and benefits. “If you ask pa- 
tients what risk they’re willing to take 
with this disease, you may be surprised,” 
Sharon Cohen, a neurologist at the Toronto 
Memory Program and investigator on the 
lecanemab trial, said during a press confer- 
ence last week. 

Amid this debate, lecanemab appears 
on the cusp of being greenlit by the 
U.S. Food and Drug Administra- 
tion (FDA). The agency approved 
aducanumab, another antiamyloid 
antibody, last year on weaker evi- 
dence of cognitive benefits despite 
the complexities and costs of the 
treatment. Such _ therapies, _lec- 
anemab included, must be given in- 
travenously, could require periodic 
brain scans for safety, and are likely 
to cost tens of thousands of dollars a 
year. The agency is expected to rule 
on lecanemab by 6 January 2023. 

If it approves the antibody, FDA 
should at the very least require a 
warning against it being “given con- 
currently with anticoagulation or 
other significant blood thinners,” 
Schrag says. He and some other sci- 

entists worry especially about people with 

two copies of APOE4. In the trial, those 
participants both benefited less from the 
treatment and faced higher risks: 9.2% of 
people with two copies of the gene vari- 
ant showed symptomatic brain swelling, 
compared with 1.4% of people with no cop- 
ies. Other studies of antiamyloid antibod- 
ies have also seen signs of higher risk in 

APOE4 carriers. 

That’s a red flag even to some fans 
of Eisai’s drug. Lecanemab is “truly a 
breakthrough” and “I do look forward to 
prescribing it,’ says Jason Karlawish, co- 
director of the Penn Memory Center at the 
University of Pennsylvania. But, he adds, “I 
will explain the benefits of APOE testing as 
a way to help a patient and family decide 
whether this is the right drug for them.” 

Karlawish says FDA should seek guid- 
ance on lecanemab from an _ advisory 
committee. Currently, no such meeting 
is scheduled. 


Charles Piller's reporting was supported by the 
Science Fund for Investigative Reporting. 
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NASA radar altimetry mission 
to study hidden ocean swirls 


Enhanced resolution of SWOT satellite will highlight how 
small eddies soak up heat and carbon 


By Paul Voosen 


ddies have been overlooked for too 

long. These turbulent swirls of water, 

ranging in size from a few kilometers 

to hundreds of kilometers across, peel 

off large ocean currents and mix heat 

and carbon dioxide into deeper ocean 
layers, like cream stirred into coffee. They are 
the most energetic feature of the ocean, criti- 
cal to getting climate models right—but also 
largely invisible to satellites, except when 
they happen to sweep up a massive bloom of 
green phytoplankton. 

No longer. Eddies and, on land, the ebb 
and flow of rivers and lakes will snap into 
focus after the launch of the Surface Water 
and Ocean Topography (SWOT) satellite, a 
joint venture between NASA and CNES, the 
French space agency. Expected to launch 
no earlier than 15 December from Van- 
denberg Space Force Base in California, on 
a SpaceX Falcon 9 rocket, the $1.2 billion 
satellite carries an altimeter that can mea- 
sure the height of a water surface to within 
a couple centimeters, allowing researchers 
to infer the movements that sculpt it. “The 
change we expect from SWOT will be quite 
dramatic,” says J. Thomas Farrar, a physical 
oceanographer at the Woods Hole Oceano- 
graphic Institution (WHOI). 
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For oceanographers it will be like slip- 
ping on a pair of eyeglasses, says Rosemary 
Morrow, a physical oceanographer at the 
Laboratory of Space, Geophysical, and 
Oceanographic Studies in Toulouse, France. 
The satellite will capture eddies as small as 
7 kilometers across and cover nearly the en- 
tire globe every 21 days. On land, SWOT will 
be able to map the changing height of more 
than 6 million lakes, from the Great Lakes 
down to ponds, while also capturing flows in 
rivers wider than 100 meters. It will replace 
spotty, infrequent measurements from the 
ground and make the field of hydrology far 
more empirical, and global, than it ever has 
been. “It’s going to help us constrain how the 
water cycle works in the Arctic, Africa—places 
where we don’t have on the ground data,” says 
Tamlin Pavelsky, a hydrologist at the Univer- 
sity of North Carolina, Chapel Hill, and co- 
lead of SWOT’s freshwater science team. 

For nearly 4 decades, NASA and CNES 
have launched a series of radar altimeter 
satellites, which use reflected pulses of radar 
to measure water height. Those instruments 
have monitored the accelerating rise of global 
sea levels, a basic indicator of climate change. 
By measuring the ocean’s bulges and dimples, 
they also track the large-scale currents that 
sweep water around the planet. But the sat- 
ellites’ coarse spatial resolution meant rivers 


A 2019 image of the Baltic Sea captured blooms 
of green algae caught up in whirling eddies. 


and small eddies were out of reach. 

SWOT gains a sharper view with the help 
of two 5-meter booms, each bearing an an- 
tenna to catch reflections of the radar signal 
SWOT pulses to Earth’s surface. The widely 
separated antennas give SWOT the resolu- 
tion to measure the height of a patch of water 
just kilometers wide, rather than hundreds of 
kilometers, bringing small eddies into view. 

Armed with the precise observations, hydro- 
logists will be able to say how lakes and riv- 
ers change seasonally, and how short-term 
climatic drivers, such as El Nino, affect those 
rhythms. For marine ecologists, SWOT will 
be able to chart how the levels of the world’s 
major rivers drop each time a dam or weir 
interrupts them, and how severely that frag- 
ments aquatic habitats. It will also see the 
ripples that betray a river’s shallows and deep 
pools, a boon to studies of how rivers evolve. 
And SWOT will capture flood waters as they 
move down river, which should help flood 
modelers, although the measurements won't 
be fast enough to help communities prepare. 

Although SWOT is supposed to oper- 
ate for just 3 years, its science team plans 
to look for correlations between the water 
flows it detects and features the ongoing 
Landsat missions see in visible light, such 
as changing lake and river widths. Those 
visible changes could serve as proxies for 
water levels, allowing researchers to con- 
tinue to keep tabs on the planet’s flows, 
Pavelsky says. “Even once SWOT is gone, 
you can still continue that analysis.” 

SWOT’s view of eddies may be its biggest 
payoff. For example, it will test predictions 
that thousands of small eddies stir the ocean 
at any one time, says Sylvia Cole, a physical 
oceanographer at WHOI. Eddies just a few 
kilometers across likely play a critical role 
in stirring heat and carbon into the oceans 
near the poles, Morrow says. They also drive 
the mixing in smaller seas, she says. “We're 
probably underestimating the energy in the 
Mediterranean by 90% because we’re miss- 
ing these smaller scale structures.” 

At the coastlines, SWOT will provide a de- 
tailed picture of how hot spots of sea level 
rise in the open ocean influence coastal in- 
undation, says SOnke Dangendorf, a physi- 
cal oceanographer at Tulane University. It 
will also study another potential threat 
to coasts: small eddies that might warm 
nearby waters, fueling stronger hurricanes, 
Morrow says. “We're trapping more heat not 
just at the surface, but also at depth.” These 
are important questions to answer, fast, as 
much of humanity lives along coasts, she 
adds. “Everything is felt more keenly in the 
coastal zone.” 
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SCIENTIFIC INTEGRITY 


Image problems besiege Stanford president 


Marc Tessier-Lavigne’s early papers are subject of school and journal investigations 


By Jocelyn Kaiser 


llegations of scientific image ma- 

nipulation are threatening Stanford 

University President Marc Tessier- 

Lavigne, a neurobiologist and former 

biotech leader. Tessier-Lavigne came 

under fire last week after an inves- 
tigation by the school’s student newspaper 
revived long-standing concerns about sev- 
eral publications on which he was a co- 
author. In response, Stanford announced a 
misconduct investigation centering on “cer- 
tain scientific articles,” including two pa- 
pers in Science and one in Cell dating back 
more than 20 years. 

The inquiry has rattled the field of neuro- 
science, where Tessier-Lavigne is a respected 
figure, as well as the broader academic com- 
munity, where some question Stanford’s han- 
dling of the matter. The university’s board of 
trustees, on which Tessier-Lavigne sits, des- 
ignated five other members to lead the in- 
quiry. One stepped aside after the paper, The 
Stanford Daily, revealed his investment firm 
has a $18 million stake in Denali Therapeu- 
tics, a company Tessier-Lavigne cofounded. 

The initial controversy triggered scrutiny 
of other papers Tessier-Lavigne co-authored 
and revealed additional problematic images. 
Image manipulation experts say at least some 
of the errors appear minor. Still, Tessier- 
Lavigne should step aside during the probe, 
one observer suggests. “The in-house investi- 
gation will have more credibility,’ says Rich- 
ard Smith, a former editor of The BMJ. “And 
his position as president is compromised 
when facing these serious accusations.” 

In a 5 December letter to faculty, Tessier- 
Lavigne said, “I welcome” the review and 
that “the integrity of my work is of para- 
mount importance to me, and I take any 
concerns that are expressed very seriously.” 
Although he took responsibility for any paper 
on which he was involved, Tessier-Lavigne 
noted that he was senior author on just the 
three papers in Science and Cell and that 
flagged images in papers in other journals, 
such as Nature and The EMBO Journal, for 
example, were prepared by collaborators’ 
labs. Indeed, co-authors elsewhere have al- 
ready taken responsibility for errors in vari- 
ous papers. 

Some outsiders think Tessier-Lavigne’s 
scientific reputation will remain intact. “A 
lot of important and very solid work came 
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out of Marc’s lab, as well as inevitably some 
studies that seemed a little overhyped. I was 
relieved when I clicked on the links to see 
that it was not some of Marc’s more semi- 
nal papers that were attracting such scru- 
tiny,” says neurobiologist Barry Dickson of 
the Queensland Brain Institute. The Science 
and Cell papers “comprise a rather small 
fraction of the overall body of work ... for 
which Tessier-Lavigne is well known and 
widely admired,” adds Harvard University 
neurobiologist Joshua Sanes. 
Tessier-Lavigne, 62, a former president 
of Rockefeller University who was also 
once chief scientific officer at Genentech, 
is known for groundbreaking research in 


Marc Tessier-Lavigne’s research on guidance cues for 
growing nerve fibers has drawn new scrutiny. 


the 1990s on proteins dubbed netrins. He 
and others showed the molecules govern 
the growth of nerve fibers called axons in 
the developing spinal cord, work for which 
he and two colleagues won the prestigious 
2020 Gruber Prize in Neuroscience. The 
1999 Cell paper and two 2001 Science publi- 
cations now under scrutiny, produced when 
he was at the University of California, San 
Francisco (UCSF), came after his initial 
landmark netrin discoveries. 

Questions about certain images in the pa- 
pers first surfaced 7 years ago on PubPeer, 
an online forum where scientists identify 
and discuss possible problems in published 
research, often anonymously. Some posts 
suggested various Western blots, which 


document a protein’s presence in a sample, 
were repeated in more than one figure or 
altered. In other cases, images or parts of 
images appear to have been cut and pasted 
into other figures. 

Elisabeth Bik, a specialist in spotting ma- 
nipulated images, reexamined the three pa- 
pers and other Tessier-Lavigne publications 
noted in PubPeer at the request of the Daily. 
She calls some image changes “beautifica- 
tion” that did not affect the papers’ conclu- 
sions but describes others as “more serious.” 

Stanford told the Daily it discussed con- 
cerns about the papers with Tessier-Lavigne 
when he was under consideration for univer- 
sity president in 2015. He submitted correc- 
tions for both Science papers, but the journal 
never posted them “due to an error,’ said 
Holden Thorp, editor-in-chief of the Science 
family of journals. Tessier-Lavigne says an- 
other error has since been identified in one 
of the Science papers; Thorp says the Stan- 
ford president has been “very collaborative” 
in discussing possible solutions. 

Tessier-Lavigne also reported errors in the 
Cell paper in 2015, but at the time its edi- 
tors found no correction was necessary, the 
publisher Cell Press said in a statement. Last 
week, Bik found another manipulated image 
in that paper, which she says appears to be 
“deliberately changing the results” and could 
indicate data falsification. Cell Press said the 
“new concerns ... warrant a closer look” and 
it now plans to investigate. The only authors 
on all three challenged papers are Tessier- 
Lavigne and a postdoc then at UCSF who ap- 
parently hasn’t published a paper since 2012. 
She did not respond to emails from Science. 

Some scientists pointed out that regard- 
less of the source of the alterations in the 
three papers, Tessier-Lavigne was the leader 
of the work. He was “responsible for the in- 
tegrity of the data,’ Bik says. All problematic 
papers that Tessier-Lavigne co-authored 
should be reviewed, she adds. “This investi- 
gation should be much wider [involving] all 
his collaborators’ institutions.” 

Tessier-Lavigne’s troubles are a warning 
to investigators everywhere to keep an eye 
on how lab members edit images of data, 
some say. “In this day and age, it’s very easy 
to manipulate digital images. There’s a lot 
of trust built into science and trust can be 
abused,” says neurobiologist Tim Kennedy 
of McGill University. “On the flip side, sci- 
ence is a self-correcting enterprise.” 
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COVID-19 


In China, “zero COVID’ has become a Catch-22 


Population chafes at control measures, but lifting them now would carry huge risks 


By Dennis Normile 


urprised and stung by protests against 
draconian “zero-COVID” policies, 
Chinese authorities are gingerly mov- 
ing to ease the burden of lockdowns, 
quarantines, and constant testing. 
But 3 years into the pandemic, China 
shows no sign of planning a major course 
change. Mathematical models suggest why: 
The country is still ill-prepared for living 
with SARS-CoV-2. Easing restrictions today 


will be allowed to isolate at home instead 
of being sent to quarantine centers. Those 
measures are “a small but important step, 
and I believe more steps will follow,’ says 
a Chinese scientist who asked to remain 
anonymous. “This will alleviate some of the 
pandemic fatigue and reduce the burden on 
the economy and society.” 

But models show why the Chinese gov- 
ernment still wants to keep a lid on. A 
study based on vaccination rates in March, 
published in Nature Medicine in May, 


People in Beijing hold up empty sheets of paper to protest China’s COVID-19 restrictions on 27 November. 


would likely trigger a massive wave of infec- 
tions, overwhelm health care facilities, and 
bring a high death toll. 

“China has not achieved high vaccination 
rates, has not used the best type of vaccines, 
and it has been very slow in communicat- 
ing [to the public] the eventual need to 
transition from elimination to suppression 
and mitigation,” says public health scientist 
Nick Wilson of the University of Otago, Wel- 
lington, in New Zealand. Other countries 
that initially followed the zero-COVID strat- 
egy, including New Zealand, used it to buy 
time to ramp up vaccination rates, stockpile 
antivirals, and boost intensive care capacity. 

The fierce protests have triggered some 
changes. Several provinces have started to 
allow people to enter public transportation, 
restaurants, and shopping centers without 
proof of a negative COVID-19 test, for ex- 
ample, and some close contacts of patients 
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found lifting zero-COVID restrictions at 
that point could “generate a tsunami of 
COVID-19 cases” over a 6-month period, 
with 112 million symptomatic cases, 2.7 mil- 
lion intensive care unit (ICU) admissions, 
and 1.6 million deaths. Peak demand for 
ICU beds would hit 1 million, more than 
15 times the current capacity. 

The unvaccinated would account for 77% 
of the fatalities, according to the authors, 
primarily at Fudan University. Boosting 
vaccination rates could slash the toll, but 
China’s elderly population has remained 
wary of vaccination. Even today, only 66% 
of those ages 80 and older have received 
two doses—versus 90% of the population as 
a whole—and just 40% have taken boosters. 

Hong Kong provides a cautionary tale: 
A big Omicron outbreak early this year 
caused nearly 6000 deaths, 96% of them 
in people 60 or older. At the time, Hong 


Kong’s vaccination rates were even lower 
than the mainland’. In the first 3 months 
of this year, the city had a COVID-19 death 
rate of 37.7 per million population, among 
the highest anywhere. 

A new study released on 28 November 
by Airfinity, a London-based health analyt- 
ics firm, suggests mainland China remains 
vulnerable. Lifting zero COVID now could 
cause between 167 million and 279 million 
cases and between 1.3 million and 2.1 million 
deaths over 83 days, according to the report. 

Besides being underused, China’s 
vaccines—which contain inactivated 
virus—are less effective than the messenger 
RNA (mRNA) vaccines available elsewhere. 
A study in Hong Kong, where people could 
choose between the Pfizer-BioNTech mRNA 
vaccine or CoronaVac, a Chinese vaccine, 
showed three shots of either were more 
than 90% effective in preventing serious ill- 
ness and death among those older than 60. 
But two doses of the mRNA vaccine were 
significantly more effective than two shots 
of CoronaVac. A separate study suggested 
CoronaVac’s protection wanes more rapidly. 

In May 2021, BioNTech and China’s 
Shanghai Fosun Pharmaceutical agreed to 
set up a 50-50 joint venture to make and 
sell BioNTech’s shot in China. But the prod- 
uct never received regulatory approval, 
perhaps to protect China’s homegrown vac- 
cines. Four Chinese companies have mRNA 
vaccines in various stages of development, 
but none has received the green light either. 
Pfizer’s COVID-19 drug Paxlovid is available 
in China, as is a locally produced monoclo- 
nal antibody therapy, but it’s unclear how 
many doses the country has. 

Mainland authorities are still hoping to 
boost vaccination rates among the elderly. 
A 29 November directive of the National 
Health Commission urged local authorities 
to set up teams to go door to door and as- 
sess whether the ill and infirm can be safely 
vaccinated, and to launch education cam- 
paigns to counter vaccine hesitancy. 

But those efforts will take time, says Xi 
Chen, a public health scientist at the Yale 
School of Public Health. “Reopening too 
soon will crowd out resources, crush the 
medical system, and cause more deaths,” he 
says. For now, China remains trapped in the 
Catch-22 of its own making. 


With reporting by Bian Huihui. 
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New U.S. law aims to light up 
medical research on cannabis 


Biden signs bill streamlining pot studies and production 


By Meredith Wadman 


t just got easier for U.S. scientists to get 
their hands on some pot—for research, 
that is. President Joe Biden last week 
signed into law a bill that streamlines ac- 
cess to marijuana for medical research. 
The new law is expected to speed the is- 
suance of government permits to scientists 
who want to study cannabis, whose medici- 
nal promise has been widely touted but re- 
mains, with a few exceptions, unproved. It 
will also expedite permits for producers— 
including universities—that want to grow 
and distribute the drug for research. And the 
measure obliges the federal government to 
make sure an adequate, uninterrupted sup- 
ply of marijuana is available to scientists. 

“We will now be able to treat marijuana 
like we treat any other substance or pharma- 
ceutical for which we hope there is potential 
benefit. We will be able to subject it to rig- 
orous scientific trial,” says Representative 
Andy Harris (R-MD), a physician and former 
National Institutes of Health (NIH)-funded 
researcher who helped usher the bipartisan 
legislation through Congress. 

“This is exciting,’ says Ziva Cooper, direc- 
tor of the Center for Cannabis and Canna- 
binoids at the University of California (UC), 
Los Angeles. “The bill is a significant step 
forward with respect to chipping away at 
the barriers” to research. Scientists are eager 
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to study cannabis and its derivatives as po- 
tential treatments for cancer, chronic pain, 
stress disorders, and other conditions. 

Other cannabis researchers welcomed the 
law but said it doesn’t go far enough. Some 
are disappointed it does not include a provi- 
sion from an earlier draft of the legislation 
that would have allowed scientists to buy and 
study the marijuana available to consumers 
in the 37 states that have legalized its recre- 
ational or medical use, or both. 

“There’s no substitute for studying real- 
world products that our patients and rec- 
reational consumers are using,” says Staci 
Gruber, a neuroscientist who runs trials of 
cannabis and its compounds as potential 
treatments for several conditions at McLean 
Hospital in Belmont, Massachusetts. 

Cannabis is classified as a Schedule I 
drug—like heroin and LSD—under U.S. crim- 
inal law. That means both scientists and the 
growers who supply the drug for research 
must obtain permits from the Drug Enforce- 
ment Administration (DEA) and follow strict 
rules for storing and handling the drug. Re- 
searchers say it can take DEA a year or more 
to respond to permit applications. 

The new law stipulates that within 60 days 
of receiving a researcher’s application, the 
agency must approve it, request more infor- 
mation, or say why it is denying a permit. If 
a researcher submits more information upon 
DEAss request, the agency then has 30 days 


Scientist Suman Chandra checks cannabis plants at 
a growing facility at the University of Mississippi. 


to make a decision. The law also makes it 
easier for researchers to amend their re- 
search protocols midstream, and ensures 
that DEA can’t demand stricter security 
measures for marijuana than for other 
Schedule I drugs. 

“The great advance of this bill is it puts 
a deadline on the [DEA’s] response time,” 
says Larry Walker, a pharmacologist at the 
University of Mississippi who has applied 
for research registrations. 

Walker is the former director of a Univer- 
sity of Mississippi center that, for decades, 
was the sole grower permitted by DEA to 
supply cannabis to U.S. scientists. But re- 
searchers complained that the center’s pot 
was not nearly as potent as what is now com- 
mercially available. Since last year, DEA has 
registered six more growers, all of them com- 
panies, to produce marijuana for research. 

Groff North America in Red Lion, Penn- 
sylvania, is one of the newly licensed grow- 
ers. Steven Groff, its founder and chief 
medical officer, dismisses complaints from 
scientists that DEA-registered companies 
can’t provide cannabis products that mir- 
ror products widely available in legal dis- 
pensaries. “That myth is over,’ he says, 
noting his firm supplies smokable mari- 
juana flower, vaping products, edibles, bev- 
erages, and soft chew tablets (gummies) 
to major universities. “We are going to be 
able to easily provide those materials now 
to researchers,” he says. 

Researchers can at least count on an 
ample supply going forward. The new law 
requires the attorney general to make an 
annual report to Congress on whether the 
DEA-controlled marijuana supply is un- 
interrupted and adequate for research needs, 
and to describe and remedy any shortfalls. 
And it orders NIH to produce a report within 
1 year that addresses, among other things, 
how barriers to studying marijuana in states 
that have legalized pot might be overcome. 

In the meantime, many researchers 
would like to see the federal government re- 
move marijuana’s Schedule I classification. 
That listing suggests cannabis is “uniquely 
harmful, and ... medically unproven,” even 
though U.S. regulators have approved a 
number of cannabis-based therapies, says 
Igor Grant, director of the Center for Me- 
dicinal Cannabis Research at UC San Diego. 

In October, Biden ordered the attorney 
general to consider reclassifying canna- 
bis. If it were moved down to Schedule II, 
pot would join a number of other approved 
drugs on that list, including morphine and 
Adderall, a treatment for attention deficit 
hyperactivity disorder. 
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HERO CITY 


Kharkiv was Ukraine’s science hotbed until Russia attacked. 
The crippled city refuses to give up 


n an SUV pockmarked by shrapnel, 
Mykola Shulga wends his way along 
the Kharkiv highway, dodging con- 
crete barriers and antitank obstacles 
scattered along the road like giant toy 
jacks. On the northern outskirts of 
his broken city, he reaches the ruins 
of Pyatykhatky, an academic enclave 
set amid oak and maple groves. “This 
was one of the most beautiful areas of 
Kharkiv,” he says, on a brisk October day. 
Before the war, it was home to many re- 
searchers at Ukraine’s largest science cen- 
ter, the Kharkiv Institute of Physics and 
Technology (KIPT). But months of shelling 
left some of the district’s apartment blocks 
uninhabitable. Shulga, a theoretical physi- 
cist and KIPT’s director-general, gets quiet 
as the SUV passes a rubble pile where Py- 
atykhatky’s hospital once stood. “They also 
destroyed the primary school,” he says. 

Russia’s bloody war on Ukraine, now 
in its 10th month, has killed or injured at 
least 17,000 civilians, including scientists 
and students. It has also 
displaced more than 
14 million, according to the 
United Nations. More than 
1300 Ukrainian scientists— 
primarily women and men 
older than 60—have fled, 
finding refuge in labs in 
other countries. Tens of 
thousands of students are » 
studying abroad. 

Nowhere in Ukraine 
has science suffered more 
than here in Kharkiv, an 
academic community that 
Gerson Sher, co-chair of a 
U.S. National Academies of —. 
Sciences, Engineering, and 
Medicine (NASEM) work- 
ing group on rebuilding 
science in Ukraine, calls f 
“the crown jewel of Ukrai- J 
nian science.” In addition 
to KIPT, the Kharkiv region 
hosts nine other institutes 
of the National Academy of & 
Sciences of Ukraine (NASU) 
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By Richard Stone, 
in Kharkiv, Ukraine 


and 65 universities and colleges. Few were 
left unscathed. Major losses include the 
control building for a unique NASU radio 
telescope and thousands of samples from a 
bombed national seed bank. The war has 
claimed 11 lives from V. N. Karazin Kharkiv 
National University, one of Ukraine’s top 
academic centers, and a diminished stu- 
dent body has triggered mandatory layoffs 
of teaching staff. 

Yet Kharkiv is resurgent. In a counter- 
offensive in September, Ukraine’s army 
recaptured nearly the entire Kharkiv 
region—putting the city outside the range 
of Russian artillery fire for the first time in 
months. As winter draws near, academic 
centers here are buzzing with activity as 
staff repair blown-out windows. Some lab 
work and online classes have resumed, even 
amid blackouts and scant heating caused 
by Russian bombardment of the country’s 


Battle lines 

Russian forces occupied Chornobyl and nearly seized Kharkiv 
soon after they invaded Ukraine on 24 February. Since then, the 
Ukrainian army has steadily clawed back territory. But Kharkiv, 
close to the border with Russia, remains vulnerable to missiles. 


@ Russian-controlled Ukrainian territory 


Belarus 


Chornobyl 
¢ Kyiv 
e 
Kharkiv 
Ukraine 


Dnieper River ——e 


Russia 


energy infrastructure. “Our enemy’s plans 
to ruin our academic life were not realized,” 
says Karazin rector Tetyana Kaganovska. 

The resilience has energized efforts to 
staunch Ukraine’s brain drain. Over the 
summer, the European Union launched a 
€25 million program, led by the Alexander 
von Humboldt Foundation, to place Ukrai- 
nian graduate students and postdocs in 
European labs with the expectation that 
they will return home when conditions al- 
low. A host of smaller programs are help- 
ing scientists stay in Ukraine. 

Alongside those emergency efforts, the 
European Union, the United States, and 
other allies have started discussions with 
Ukrainian counterparts on what a Mar- 
shall Plan for Ukrainian science might look 
like—if funding were to materialize. This 
month, the European Union is expected to 
approve a program in which DESY, Germa- 
ny’s largest accelerator center, will spear- 
head an effort to map Ukraine’s science 
landscape, says DESY’s Martin Sandhop. 
The exercise would inform 
any future makeover of 
Ukraine’s many obsolete 
Soviet-era facilities and 
practices. “Ukraine needs 
to rebuild the system, not 
just the buildings,” says 
Yury Gogotsi, a Ukraine- 
born materials chemist at 
Drexel University. 

First, though, Ukraine 
must win the war—and 
many look to Kharkiv for 
inspiration. A billboard on 
the Kharkiv highway cap- 
tures that sentiment: Misto 
Heroit Kharkiv, or “Hero 
City Kharkiv.” That title 
recalls the honorific So- 
viet leaders bestowed on a 
dozen cities that withstood 
Nazi attacks during World 
War II, including four in 
Ukraine. Kharkiv did not 
make that list, but today it 
epitomizes Ukraine’s resis- 
tance to Russian aggression. 
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“Our colleagues in Kharkiv sincerely be- 
lieve in victory, that their city will stand 
despite all the troubles. These are not fake 
emotions,” says NASU President Anatoly 
Zagorodny, who visited Kharkiv in August. 
But the city is down to half its prewar 
population of 1.4 million and is still in easy 
range of Russian missiles. Even Kharkiv’s 
most ardent admirers acknowledge it will 
be a challenge to lure researchers and 
students back to rebuild the city’s once- 
vibrant science scene. 


IN THE FIRST DAYS of the war, Russia un- 
leashed a blitzkrieg on KIPT, perhaps 
because of suspicions that the institute har- 
bored a secret nuclear weapons program. 
In a former incarnation a half-century 


After Ukraine became independent in 
1992, KIPT emerged from the nuclear shad- 
ows as Ukraine’s first National Research 
Center. It struggled to fund science amid 
the economic tumult and government cor- 
ruption of the 1990s. But in a deal the U.S. 
government helped broker, from 2010 to 
2012 KIPT and two other Ukrainian entities 
relinquished their weapons-grade, highly 
enriched uranium, shipping 234 kilograms 
of it to Russia, which blended it down to 
low-enriched uranium (LEU). In exchange, 
the U.S. Department of Energy promised 
to provide Ukraine with LEU fuel for 
power reactors, a program that KIPT now 
oversees. The United States also provided 
KIPT with funds and expertise to build a 
$90 million experimental reactor. 


At the Institute for Single Crystals, a technician prepares samples in an ultra—low-humidity room. 


ago, KIPT was in fact a weapons lab; it 
played a leading role in developing the 
Soviet bomb. The storied institute was 
founded in 1928 as the Ukrainian Institute 
of Physics and Technology by Abram Ioffe, 
widely regarded as the father of Soviet 
physics. Renowned theoretical physicist 
Lev Landau came a few years later, and 
the duo assembled a physics dream team, 
including the likes of George Gamow, a 
polymath who discovered the quantum 
tunneling mechanism for radioactive al- 
pha decay and developed the big bang 
theory of our universe’s origins. By 1946, 
Igor Kurchatov, architect of the Soviet nu- 
clear weapons program, had anointed the 
institute “Laboratory No. 1” and closed its 
doors to the outside world. 
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This facility, called the Neutron Source, 
would generate medical isotopes and 
beams of neutrons for probing materials. 
It would also serve as a prototype for sub- 
critical nuclear power reactors that, un- 
like existing plants, do not sustain a chain 
reaction in their core. In August 2021, 
KIPT completed the reactor itself and a 
100-megaelectronvolt accelerator that fires 
neutrons into the reactor to maintain fis- 
sion reactions. 

KIPT had just finished loading LEU fuel 
into the core and was ramping it up to a 
feeble initial power level of 20 kilowatts 
when war broke out. On 24 February, KIPT 
hurriedly shut down the core as explosions 
reverberated outside. In March and again 
in June, missiles and artillery shells dam- 


aged the building housing it and laid waste 
to its power supply, Shulga says. But the re- 
actor vessel and its uranium fuel rods were 
unscathed. “That was a miracle,” he says. 

Elsewhere at KIPT, missiles blasted 
holes in buildings, blew out windows, 
gouged craters, and demolished the roof 
of its solid-state physics lab. After an 
International Atomic Energy Agency 
(IAEA) team visited last month, Direc- 
tor General Rafael Mariano Grossi called 
the damage “dramatic and shocking, even 
worse than expected.” 

The withering attacks hollowed out the 
institute: About half of its 2000 staff left 
Kharkiv, including dozens of elite scien- 
tists who found refuge in safer areas in 
Ukraine and 30 other countries. Seven are 
now at DESY. “Before the war, KIPT 
wasn’t even on our radar screen,” 
says DESY particle physicist Frank 
Lehner. DESY has since discovered 
KIPT’s “extremely strong” work in 
theoretical physics, he says. KIPT’s 
remaining staff are trying to sus- 
tain a minimal research program 
and repair salvageable buildings be- 
fore winter. 


CLOSER TO THE CITY CENTER, another 
NASU stronghold took heavy fire. 
Established in 1955, the Institute 
for Single Crystals (ISC) may be 
best known outside Ukraine for 
producing the lead-tungstate crys- 
tals and plastic scintillators used 
in particle detectors at the Large 
Hadron Collider in Switzerland 
(which discovered the Higgs bo- 
son) and the cesium iodide crystal 
detectors used in the Belle experi- 
ment in Japan (which observed 
asymmetries between matter and 
antimatter known as charge-parity 
violations). “Our materials helped 
win two Nobel Prizes,” says ISC Di- 
rector General Volodymyr Semynozhenko. 
ISC also exports industrial-grade sap- 
phires and other specialty crystals, and 
it has diversified into chemical reagents 
and pharmaceuticals. 

Until September, Valentin Chebanov, 
ISC’s deputy director, camped out in 
his office, believing it was safer than his 
apartment downtown. The institute also 
converted a vast subbasement in its main 
building into a shelter with cots and a 
cooking area. Some 200 staff and their 
families holed up there for months. Brac- 
ing for a rough winter, ISC is installing in- 
ternet and making other improvements to 
the shelter. 

Shelling damaged ISC, although not 
nearly as extensively as KIPT. ISC staff 
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The volunteer 


Iga Shpak made headlines around the world in 2017, when 
she captured the first footage of a pod of orcas using 
teamwork to hunt a young bowhead whale in the Sea of 
Okhotsk. The drone video was a sensational moment in her 
2 decades of methodical fieldwork investigating bowheads, 
belugas, and other whale species—studies that had thrust 
the Ukrainian marine mammologist into the upper echelon of 
cetacean researchers and earned her a position at the A.N. Sev- 
ertsov Institute of Ecology and Evolution in Moscow. “| loved my 
work. | could not imagine doing anything else,” she says. Then 


the Russians stormed her homeland. 


In February, as the war drums grew louder, Shpak took an 
overnight train from Moscow to Belgorod, near the Ukraine 
border, where her brother picked her up on the morning of the 
23rd and drove her to their native Kharkiv. The very next day, the 
city was under siege. Shpak joined a local volunteer group and 
threw herself into sourcing medicine for civilians, and clothing 
and other supplies for the army. Thanks to her fluent English, 
she also serves as a liaison between Kharkiv’s volunteers and 


international aid groups. 


Several Russian colleagues emailed Shpak, expressing their 
shame over the war. “They cannot speak the truth in public,” she 
says. She misses whales but has no time to think about science. 
“| don't know if | will ever work as a Scientist again,” she says. 
“My priority now is people. My fellow Ukrainians. This was a 


great discovery for me.” —R.S. 


piled sandbags in front of windows in 
rooms with pricey instruments. But an 
electrical surge from the unstable grid 
fried the circuit board of a nuclear mag- 
netic resonance machine, used for de- 
termining crystal structures. “You can 
imagine, no one wants to come to Kharkiv 
to fix it,’ Chebanov says. ISC researchers 
now must send samples to Kyiv—a 3- or 
4-day turnaround. And they face chronic 
shortages of research-grade argon, helium, 
and other gases after a local supplier was 
bombed. 


SOME 60 KILOMETERS southeast of Kharkiv, 
at the Semen Braude Radio Astronomy 
Observatory, a unique instrument is in 
shambles. The Ukrainian T-Shaped Radio 
Telescope, second modification (UTR-2) is 
the world’s largest radio telescope tuned 
to decameter wavelengths, low-frequency 
emissions that hold clues to pulsars, the 
composition of the interstellar medium, 
and other phenomena. “They are pio- 
neers of the lowest radio frequencies,” says 
Philippe Zarka of the Paris Observatory. 
Russian forces seized the Braude ob- 
servatory on 25 February and used the 
140-hectare grounds as a military base, 
says Vyacheslav Zakharenko, director of 
NASU’s Institute of Radio Astronomy (IRA) 
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on whales to spend her days sourcing supplies for Ukrainian soldiers. 


in Kharkiv, which runs the observatory. Af- 
ter Ukrainian troops liberated it in early 
September, IRA staff found no damage to 
UTR-2’s 2040 small dipole antennas, ar- 
ranged in two arms extending 900 meters 
and 1.8 kilometers. But what they saw at 
the main building “was heartbreaking,” 
Zakharenko says. Russian troops had 
looted or smashed instruments and com- 
puters and its roof was caved in. They’d 
also laid mines, including some in the ser- 
vice tunnels connecting the antennas to 
the central building. 

Zakharenko’s team has moved salvage- 
able equipment into intact buildings 
before winter sets in. Replacing the elec- 
tronics in UTR-2’s control room “would 
be fast and cheap,” says Michel Tagger, an 
astrophysicist at CNRS, the French na- 
tional science agency, in Orleans. “That 
would make sense. It would keep the sci- 
entists there working,’ Zarka adds. But 
UTR-2 is 50 years old and becoming “more 
and more obsolete,” he says. 

IRA’s long-term prospects may hinge on 
completing a successor to UTR-2 called the 
Giant Ukrainian Radio Telescope (GURT). 
It could detect distant exoplanets in a new 
way, by looking for the low-frequency radio 
flares whipped up when planetary mag- 
netic fields interact with those of their host 


stars. By 24 February, IRA had installed 
five of GURT’s 100 planned antenna ar- 
rays. When fully deployed, GURT would be 
10 times more sensitive than UTR-2. 
Tagger calls for a concerted European 
effort to help complete GURT and link it 
to the Low-Frequency Array (LOFAR), a ra- 
dio telescope network in seven European 
countries that he and Zarka helped build. 
“You don’t want to see 50 years of experi- 
ence in radio astronomy in Ukraine wiped 
out,” Zarka says. “Welcoming them into 
LOFAR would be a very good move.” 


FOUNDED IN 1804, Karazin is one of the old- 
est universities in Eastern Europe. Before 
the war, it had 23,000 students, includ- 
ing 4500 from overseas primarily study- 
ing medicine. When Kharkiv came under 
fire, about one-quarter of Karazin’s facili- 
ties were destroyed, including schools for 
physics, business, economics, and public 
administration, as well as three dorms 
and a new sports complex. Nearly all the 
foreign students fled, as did many Ukrai- 
nian students. Scores of students and fac- 
ulty joined the militia or volunteer units. 
Six teachers and five students are known 
to have perished on the battlefield or in 
shelling of civilian areas. “Every life lost is 
a profound grief for us,” Kaganovska says. 
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Others stayed to protect the university. 
After a missile attack in March fractured a 
pipe and began to flood Karazin’s archaeo- 
logy museum, staff dodged artillery fire 
to come to campus and save collections 
from water damage. Faculty and students 
also rescued herbarium samples after the 
central heating system went down and 
salvaged instruments from the School of 
Physics and Technology in Pyatykhatky. 

Karazin’s classes went fully online in 
March. It has also struck deals with uni- 
versities in western Ukraine and in other 
European nations to host hundreds of its 
students as they earn Karazin degrees. 
That has helped Karazin hold onto about 
85% of its Ukrainian students. “Preserving 


their education is the most important way 
we can help our country,’ Kaganovska says. 
But the university faces financial woes. Be- 
cause the central government pegs university 
budgets to the number of enrolled Ukrainian 
students, it reduced Karazin’s funding for the 
current academic year by 15%. Karazin also 
lost tuition revenue from foreign students. 
Forced to make painful cuts, Kaganovska, 
a law professor, did not renew contracts for 
48 faculty members in the School of Physics, 
including its chair. Other departments fared 
better, which “caused enormous outrage,’ 
says Eugene Chudnovsky, a physicist at the 
City University of New York who grew up in 
Kharkiv and earned his Ph.D. from Karazin. 
The university, he says, “has always been a 


SCIENTISTS AT WAR 


The caretaker 


uliia Guglya says it was always a dream to live where she works. That dream has 
now come true—but as part of a nightmare. In her work as an entomologist, Guglya 
had spent countless hours in the collections of the Museum of Nature at V. N. Karazin 
Kharkiv National University. But when Russia invaded Ukraine on 24 February, she 
became its resident curator. She, her husband, and their 21-year-old son moved into a 
warren of rooms in the museum's basement, where she has stashed some of the 215-year- 


old collection’s prize holdings. 


Guglya’s first love was butterflies, before her graduate adviser implored her to switch 
to leaf miner flies, which no one at the time was studying in Ukraine, she says. “| was 
so sad. | thought flies were disgusting.” Guglya came around to them and has discov- 
ered 25 species in the leaf miner family. Many more remain to be described among her 
40,000 specimens, she says. “I couldn't evacuate my collection. And anyway, | didn’t 
want to escape.” For weeks, Guglya, museum director Rostyslav Luniachek, and a few 
colleagues worked to move the most coveted specimens to the basement. They then 
turned to boarding up the 200 or so windows blown out by bomb blasts. “We're lucky,” 


she says. “My husband's a carpenter.” 


Guglya doesn't know when she and her family—her parents-in-law joined them in April— 
will vacate the museum. Perhaps sensing her distress, a young black cat, named Javelin— 
after the U.S. shoulder-fired antitank rocket—ambles over to her makeshift desk and plops 
onto her lap. “He helps us to be here,’ she says, stroking Javelin’s fur. But she does not 
regret her choice. “We are saving the museum. And the museum is protecting us.” —R.S. 


Yuliia Guglya moved her specimens to safety in the Museum of Nature’s basement. 


1040 9 DECEMBER 2022 + VOL 378 ISSUE 6624 


major supplier of scientists to Kharkiv’s re- 
search labs.” 

Kaganovska says the student-teacher ratio 
has been unusually low in Karazin’s physi- 
cal sciences departments, compelling the 
university to tap reserves to cover salaries— 
money the war has largely drained. She re- 
instated the laid-off physicists last month, 
after a U.S. donor stepped up to cover part 
of the shortfall. “We are looking for the rest 
now,’ Kaganovska says. “Definitely our mis- 
sion is preserving science.” 

When peace atrives, Karazin is counting 
on far-flung faculty and students to return 
to campus. Several universities here, includ- 
ing the University of Biotechnology, have 
pulled up stakes and now operate in west- 
ern Ukraine. Kharkiv can’t afford further 
losses, Chudnovsky says. “The future of 
Kharkiv will very much depend on educa- 
tion in Kharkiv.” 


AS THE WAR GRINDS ON, scientists across 
Ukraine are looking beyond a harsh winter 
of power cuts to an expected victory—and 
the prospect of rebuilding tattered insti- 
tutes and universities. 

An overriding priority is preventing 
a further loss of talent. A mad scramble 
in the spring to place refugee-scientists 
in labs outside Ukraine has now evolved 
into a widespread effort to help more than 
50,000 scientists who have stayed. Many 
are men under the age of 60 who are for- 
bidden from crossing the border without 
explicit permission, in case they are needed 
for the military. (For now, scientists and 
students are exempt from service.) Thou- 
sands of women scientists, too, have stayed 
to care for family members or serve in the 
military or as volunteers. 

At least three dozen assistance programs 
in Europe and the United States are now 
throwing lifelines to scientists in Ukraine. 
Many are small efforts, targeted to spe- 
cific disciplines. For instance, an initiative 
of the Institute of Electrical and Electron- 
ics Engineers, a large U.S. society, funded 
22 Ukrainian proposals on magnetism to the 
tune of $180,000 this year and plans to spend 
another $100,000 on new projects in 2023. 

But wartime conditions hamper re- 
search. “You can’t do science without 
money. Without electricity,’ says Jerzy 
Duszynski, president of the Polish Acad- 
emy of Sciences (PAN). So NASEM and PAN 
are testing an innovative strategy. With 
$3 million from the Breakthrough Prize 
Foundation, they launched a program in 
October that will allow up to 30 groups led 
by elite scientists in Ukraine to spend up to 
3 months a year doing hands-on research 
in the labs of Polish collaborators—and pay 
their salaries for the remainder of the year 
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Dipole antennas at a pioneering radio telescope outside Kharkiv, Ukraine, were not damaged during Russia’s monthslong occupation. 


in Ukraine. Such a lifeline, Duszynski says, 
“will help keep Ukrainian science alive.” 
Over time, Duszynski hopes the elite teams 
will nucleate Ukraine’s scientific revival, 
by strengthening their connections to top 
institutions all over the world. 

For KIPT, German institutes are emerg- 
ing as potential saviors. DESY aims to 
bring KIPT scientists into a new quantum 
electrodynamics experiment at the Euro- 
pean X-ray Free-Electron Laser, a facility 
that has suspended collaborations with 
Russia. And the GSI Helmholtz Centre for 
Heavy Ion Research in Darmstadt, Ger- 
many, is looking to involve KIPT in FAIR, 
a particle accelerator under construc- 
tion that would use high-intensity beams 
to create matter that now only exists in 
the universe’s harshest environments, such 
as supernovae. 

Embedding Ukrainian scientists in Eu- 
rope’s premier science facilities makes 
sense as Ukraine grapples with what re- 
search infrastructure to rebuild—and 
where—after the war, says theoretical phys- 
icist Yulia Bezvershenko, a visiting scholar 
at Stanford University. “We should build 
something they don’t have” in Europe. 

She and others hope peace will open 
the way to an overhaul of Ukraine’s R&D 
enterprise. The government’s own draft re- 
covery plan for science and education calls 
for setting up a genomics research center 
and a Ukrainian version of the U.S. De- 
fense Advanced Research Projects Agency, 
which funds high-risk research. The plan 
also calls for rebuilding Ukraine’s small 
oceanographic research fleet. 

Another objective is expanding merit- 
based funding. The main source of com- 
petitive grants is the nascent National 
Research Foundation of Ukraine (NRFU), 
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which in 2020 allotted $9 million to 216 
projects. NRFU’s $25 million budget for 
this year was sequestered for the military. 
But in 2023 it plans to spend $12 million 
on 57 projects related to security and sus- 
tainable development, says NRFU Execu- 
tive Director Olga Polotska. 

Gogotsi gives NRFU high marks for re- 
cruiting expats and other scientists outside 
Ukraine as reviewers. He’s less flattering 
about other parts of Ukraine’s research 
establishment—especially NASU, which 
he says is largely unchanged since the So- 
viet era. Any infusion of postwar funding 
for Ukrainian science, he argues, should 
be tied to integrating NASU’s many insti- 


Russian troops looted and smashed instruments in a 
radio telescope’s central building. 


tutes into the university system and get- 
ting them to compete for funding. He also 
wants to see them become “greenhouses” 
for tech startups. 

Although NASU’s Zagorodny recognizes 
the need for reform, he says he wants to 
“maintain our historical traditions.” He re- 
jects the idea of subordinating NASU insti- 
tutes to universities. As a better model, he 
points to Germany’s Max Planck Society’s 
world-class institutes, which thrive inde- 
pendently alongside the country’s univer- 
sity system. 

Like many in Ukraine, Shulga is not 
waiting for peace before planning a re- 
vival. In October, at an IAEA conference in 
Washington, D.C., he met with Grossi, who 
he says supports KIPT’s aim to establish 
an International Center on Nuclear Phys- 
ics and Medicine with the Neutron Source 
facility as its centerpiece. 

Bezvershenko, however, says it’s risky 
to build major scientific infrastructure 
near the Russian border. Chudnovsky has 
even floated the idea of relocating KIPT, if 
only temporarily, to western Ukraine. That 
move would echo the Soviet response to 
the Nazi invasion of Ukraine during World 
War II, when it evacuated Kharkiv’s sci- 
ence establishment to eastern Russia. 

But Shulga has no plans to move, not least 
because it would be impossible to uproot the 
Neutron Source and other large KIPT facili- 
ties. More importantly, such a retreat would 
undermine the ambitions of Ukrainian sci- 
ence leaders to restore the luster of Kharkiv, 
the nation’s hero city and its scientific jewel, 
Zagorodny says. “That’s how we will honor 
its courage and heroism.” 


Richard Stone's Ukraine reporting was supported by 
the Richard Lounsbery Foundation. 
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OUT OF THE ASHES 


Slavutych was built after the nuclear meltdown at Chornobyl. 
The town, scarred by war, is planning a research revival 


n one of the last acts of camaraderie in a 
fracturing nation, Soviet laborers from 
the Baltics to the Caucasus converged 
on a pine forest in the late 1980s to 
build a Ukrainian town from scratch. 
Slavutych was a new home for work- 
ers from the Chornobyl Nuclear Power 
Plant and their families after its 1986 
explosion turned Pripyat, the town 
next to the plant, into a radioactive waste- 
land. In March, Slavutych’s 20,000 residents 
endured a different breed of terror, as Rus- 
sian troops massed outside the town. 
Heroic dashes around enemy lines to se- 
cure supplies—and a potent display of soli- 
darity in the town’s central square—staved 
off disaster. Now, Slavutych, like the rest 
of Ukraine, is girding for a grueling win- 
ter of electricity outages. But Mayor Yuri 
Fomichev and Anatolii Nosovskyi, a radia- 
tion warrior with decades of experience 
at Chornobyl, are already planning Slavu- 
tych’s postwar rebirth as a science center, 
one that would focus on a formidable chal- 
lenge: dismantling the radioactive remains 
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By Richard Stone, 
in Slavutych, Ukraine 


of Chornobyl’s destroyed Unit 4 reactor. 
Lessons learned at Chornobyl—the mother 
of all decommissioning projects—can be ap- 
plied globally, at dozens of nuclear plants 
that are slated to shut down in the coming 
years, says Nosovskyi, director of Ukraine’s 
Institute for Safety Problems of Nuclear 
Power Plants (ISPNPP) in Kyiv. “Our idea 
is to bring together specialists from across 
Ukraine in Slavutych. It’s what the town 
was always meant to be—a research cluster.” 
Creating a science center “in an envi- 
ronment that’s unique in the world would 
very much make sense,” says Kai Vetter, a 
nuclear physicist at University of California, 
Berkeley, whose team has donated instru- 
ments and supplies to ISPNPP since the 
war started. “It’s a fantastic idea,” adds Nick 
Tomkinson, a nonproliferation expert at 
Global Nuclear Security Partners, a London 
consulting firm that hopes to map radio- 
active contamination around Chornobyl. 


On 24 February, Russian troops swarmed 
across the border with Belarus and seized 
the Chornobyl plant. Nosovskyi, who began 
his career working on nuclear submarine 
radiation safety, worried the invaders would 
reawaken the radioactive nightmare he wit- 
nessed when he was dispatched to Chornobyl 
in 1987. There, he monitored the radiation 
received by tens of thousands of scientists 
and soldiers as they erected a concrete shel- 
ter over Unit 4’s seething remains. Over the 
years, he worked hand in hand with Russian 
scientists—cooperation that ceased after Rus- 
sia’s illegal annexation of Crimea in 2014. “We 
figured out how to manage the Chornobyl 
problem on our own,” he says. 

In April, after Russian soldiers retreated 
from the region, Nosovskyi ventured back 
to a satellite ISPNPP facility in Chornobyl. 
Smashed instruments and glass shards car- 
peted its chemistry lab’s floor. Looters had 
made off with a dozen vehicles and the fa- 
cility’s newest computers. Scores more com- 
puters had been stripped of hard drives. 
“T gave up smoking 4 years ago. Took it up 
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again after seeing what happened to the 
lab,’ Nosovskyi says, lighting up a cigarette. 
(As he talked to Science at an outdoor café 
on 10 October, three Russian cruise missiles 
whizzed by overhead on their way to targets 
in western Ukraine.) 

Nosovskyi believes Russia raided the 
Chornobyl] lab for evidence that Ukraine was 
working to secretly develop nuclear weap- 
ons, as three Russian newspapers falsely al- 
leged in late February. “Of course, they made 
that up,” he says, pointing out 
that Ukraine’s nuclear facilities 
are monitored by the Interna- 
tional Atomic Energy Agency. 

Russia’s military also took 
plant workers and guards hos- 
tage. A chilling moment came 
when two officers demanded 
access to one of Chornobyl’s 
two spent fuel repositories, 
which harbor ferociously radio- 
active material. The crew chief 
rebuffed them, says Valeriy 


“It’s what 
the town was 
always 
meant to be- 
aresearch 
cluster.” 


Anatolii Nosovskyi, 


Acontrol room in Chornoby!’s Unit 
reactor has been empty since 1986, 
but the reactor’s remains still seethe. 
- le 


headache to transport staff from Slavutych 
to Chornobyl for decommissioning tasks 
at the plant’s four shuttered reactors and 
maintaining the spent fuel repositories. 
Before the war, workers could make the 
50-kilometer commute by train through Be- 
larus. With that border closed, staff endure 
a 340-kilometer, 6-hour bus ride around Be- 
larus, and work 8-day shifts requiring them 
to bunk in Chornobyl. 

Slavutych, meanwhile, was devastated 
economically. In the months be- 
fore the invasion, the town had 
become a base camp for foreign 
tourists inspired to visit by the 
HBO miniseries Chernobyl. The 
war crushed that cottage indus- 
try. In late February, a bridge on 
the only road into Slavutych was 
blown up, and a Russian battal- 
ion camped outside the town. As 
supplies diminished, volunteers 
undertook perilous drives on log- 
ging roads—dodging shells and 


Seyda, Chornobyl’s acting di- Institute for Safety Russian patrols—to reach vil- 
rector, and the suspicious Problems of lages outside the blockade and 
pair left without incident. NuclearPowerPlants spirit back staples such as milk, 

But the occupiers _ stole potatoes, and flour. “That al- 


equipment from the plant and tracked in 
radioactive contamination after digging 
trenches and laying mines in the nearby Red 
Forest—named for pines killed by a plume 
of radiation from the 1986 accident. Since 
the soldiers’ retreat, the plant complex has 
been cleaned up and parts of Chornobyl vil- 
lage have been demined, but it remains a 
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lowed us to survive,’ Fomichev says. 

In early March, natural gas supplies to 
Slavutych were interrupted, shutting down 
the town’s communal heating plant. Engi- 
neers rigged a boiler to run on firewood, 
which managed to supply just enough 
heat to keep apartment blocks from freez- 
ing. Then Russian troops sabotaged the 


main powerline to Slavutych. The lights 
went out—as did electric stoves. Residents 
resorted to cooking over open fires on the 
streets in frigid temperatures. 

In the meantime, Slavutych organized a 
militia: about 200 volunteers, a handful of 
whom had military training. Equipped only 
with Kalashnikov rifles, they engaged Rus- 
sian tanks and artillery on the town’s out- 
skirts. “We put up a brave fight,” Fomichev 
says. Five died in skirmishes, and badly out- 
gunned, the militia on 25 March agreed to a 
Russian demand to leave the city. But when 
a Russian convoy entered the town square, it 
encountered a throng of unarmed protesters 
chanting “Slavutych is Ukrainian, go home!” 
“The commander had no idea what to do. 
Apparently, he didn’t have the stomach to 
slaughter unarmed civilians,’ Fomichev says. 
Two days later, the battalion withdrew. 

The town’s struggles are not over. Over the 
past several weeks, Russian bombardments 
have left Slavutych without electricity most 
hours of the day, and town officials worry 
about further disruptions to the supply of 
natural gas. 

But Fomichev and Nosovskyi are already 
proselytizing for their vision of Slavutych as a 
nuclear science hub specializing in radiation 
medicine, radioecology, and the monumental 
task of decommissioning the Unit 4 reactor, 
an effort expected to last at least 40 years. 
Research is sorely needed in such areas as 
radiation-hardened robotics and the proper- 
ties of irradiated graphite from the destroyed 
reactor’s core. “We really don’t know how to 
safely handle such material,” Nosovskyi says. 
“There are enormous opportunities to de- 
velop and demonstrate advanced technolo- 
gies,” Vetter says. 

Nosovskyi dreams of eventually installing 
a small modular reactor, built in a factory 
elsewhere and shipped to Slavutych to gen- 
erate power and to use as a training facility. 
He envisions launching a training program 
on reactor decommissioning with the help of 
the Igor Sikorsky Kyiv Polytechnic Institute, 
which already has a branch in the town. With 
the train link down for the foreseeable future, 
Fomichev is exploring a ferry service on the 
Dnieper River that would speed the commute 
between Slavutych and Chornobyl. 

Realizing the vision would require gov- 
ernment or international funding, a pos- 
sibility only after the war ends, Nosovskyi 
acknowledges. But the war has strength- 
ened his resolve. He mentions the grand- 
son his daughter gave birth to in Kyiv in 
March while holed up in an underground 
shelter. “The little boy doesn’t know yet 
how brave he was. How brave his mother 
was. But she named him Lev,’ the Ukrai- 
nian word for lion. “So he will know, and 
remember, someday.” & 
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AAAS/SUBARU SB&F PRIZES FOR EXCELLENCE IN SCIENCE BOOKS 


New books for young scientists 


A charming science-inspired spell book, an overdue ode to overlooked 
scientists, a vivid exploration of the circle of life, and a dazzling tome 
packed with images of the cosmos are featured among the finalists for 
the 2023 AAAS/Subaru SB&F Prize for Excellence in Science Books, a 
collection of awards sponsored by Subaru of America, Inc., in partner- 
ship with the American Association for the Advancement of Science 
(AAAS, the publisher of Science). Read on for reviews of the finalists— 
which take on topics ranging from the fungal to the forensic—written 
by Science staff with the help of a few friends. —Valerie Thompson 


CHILDREN’S SCIENCE PICTURE BOOK 


Fox 


Reviewed by Sacha Vignieri* 


Understanding death is a challenge for 
most, but perhaps especially so for young 
children. Yet sometimes even they must grap- 
ple with this phase of the life cycle. In Fox: 

A Circle of Life Story, Isabel Thomas tells a 


story about the life and death of a mother fox. 


The book begins by playfully describing the 
mother fox’s life as she hunts and cares for 
her kits as they grow, play, and learn. Midway 
through, the fox meets an untimely end when 
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she is hit by a car after a successful hunt. Her 
body, readers learn, provides food for scaven- 
gers and decomposers, and eventually all that 
was once the fox is recycled into new life. At 
the end of the book, Thomas provides a more 
detailed description of the decomposition and 
recycling process, describing, in an accessible 
way, the science behind the book’s narra- 
tive. The story, warmly illustrated by Daniel 
Egnéus, not only teaches the biology of death, 
decomposition, and renewal but may provide 
solace to children affected by death or dying. 
And yet, while the story is sweet and the 
illustrations adorably depict the fox and 
her family engaging in typical fox behav- 
iors in the book’s first half, the mother’s 


death feels jarring, especially because 
much has been made of fox family’s bonds 
before this event. A child might wonder, 
“What will happen to the fox kits? How 
will they survive?” The book’s illustra- 
tions suggest that they do survive, but 

in nature, they probably would not have 
made it through the winter alone. For 
most children, this book’s theme might be 
too upsetting, but for those who have lost 
someone they love, it may help them to 
better understand the ways death makes 
way for new life. 


Fox: A Circle of Life Story, /sabe! Thomas, 
Illustrated by Daniel Egnéus, 
Bloomsbury Children’s Books, 2021, 48 pp. 


Good Eating 


Reviewed by Jennifer Sills? 


In Good Eating: The Short Life of Krill, 
author Matt Lilley challenges readers to look 
at the world through the eyes of a krill, a 
tiny crustacean that serves as a food source 
for many marine animals. “You” (the reader, 
embodying the krill’s experience) sink down 
into the dark ocean as an egg, hatch, and 
swim back to the surface, molting repeatedly. 
Lilley describes each stage of the krill’s life 
cycle in simple language. When the egg first 
hatches, he explains, “You are a six-armed 
oval.” Later: “You grow more spines.” Just 
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before the krill reaches the surface, he quips, 
“You grow a face...sort of. About time!” 

As the krill matures, the reader discovers 
its special skills: Your transparent stomach 
can turn green! You can light up! Now at the 
water’s surface, you learn that you are not 
alone: There might be a “million million” 
creatures just like you—perhaps “a krillion”? 
(The appendix, which provides background 
about the Antarctic krill, clarifies that a mil- 
lion million is, in fact, a trillion.) 

Dan Tavis illustrates the krill’s journey 
with clean ink lines and watercolor shad- 
ing. The krill’s red hues stand out against 
a backdrop of cool blues and greens. The 
images reinforce the reader’s role by posi- 
tioning predators as a krill might see them. 
The hungry faces of penguins, slightly 
distorted as if viewed from underwater, 
peer down from an opening in an ice sheet. 
The mouth of a blue whale fills a subse- 
quent two-page spread. The book’s main 
character pops with personality, thanks 
to its googly eyes and entertaining facial 
expressions. Although the text plainly de- 
scribes the krill’s most important role—eat- 
ing single-celled organisms until becoming 
food itself—there is no need to worry. The 
little krill around whom the story revolves 
escapes its predators and lives to continue 
its adventure...this time. 


Good Eating: The Short Life of Krill, 
Matt Lilley, Illustrated by Dan Tavis, Tilbury House 
Publishers, 2022, 36 pp. 
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HeroRat! 


Reviewed by Jeffrey Mervis? 


HeroRat! tells the story of Magawa, an 
African giant pouched rat who has been 
trained to sniff out buried land mines. The 
book’s 32 pages are packed with colorful and 
action-filled drawings. 

There is also plenty of content. HeroRat! is 
stuffed with details about Magawa’s training, 
including the three reasons that pouched 
rats “make better heroes” than dogs (they 
cost less to keep and train, they can easily 
work with different handlers, and they do not 
weigh enough to trigger mines). The main 
character has a can-do personality bordering 
on braggadocio. “I thought it was going to be 
hard to become a hero, but I’m really good at 
searching for those awful explosives,’ Magawa 
says about his training by a Tanzanian non- 
profit before heading off to Cambodia, which 
we learn ranks second to Afghanistan as the 
country with the most land mines. There, he 
performs so well that he garners a gold medal 
“for animal gallantry,’ billed as the first ever 
awarded to a rat. 

The book’s somber message—that animals 
can help the planet recover from the ravages 
of war—is probably too complicated, if not 
downright scary, for little children. But hav- 
ing Magawa deliver a first-person account of 
his training renders it a bit too childish for 
older students. So the book’s target audi- 
ence is not obvious. Asked what he liked best 


about HeroRat!, my 9-year-old grandson re- 
plied, “You normally think of rats as bad. But 
this book shows how they can help people.” 
Adults may want to consider adding this 
simple but important message as they share 
Magawa’s story with young listeners. 


HeroRat!: Magawa, a Lifesaving Rodent, 
Jodie Parachini, Illustrated by Keiron Ward and 
Jason Dewhirst, Albert Whitman & Company, 2022, 32 pp. 


Tu Youyou’s Discovery 


Reviewed by Caroline Ash* 


In 2015, Tu Youyou became the first Chi- 
nese woman to win a Nobel Prize, sharing 
the Prize in Physiology or Medicine for 

her role in identifying the ingredient of an 
herbal remedy that is now the world’s most 
important antimalarial drug. Tu Youyou’s 
Discovery, by Songju Ma Daemicke, tells 
the scientist’s inspiring story. 

At a time when most girls in China were 
not formally educated, Youyou’s parents 
sent her to school, but at 15 years old, she 
contracted tuberculosis and was bedrid- 
den for several months. With the help of 
antibiotics and her mother’s traditional 
remedies, Youyou made a full recovery over 
the next two years. This experience became 
her inspiration, and she persisted with her 
education to become a research scientist in 
1955, applying modern scientific methods to 
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test traditional medicines as potential treat- 
ments for drug-resistant malaria. 

With clear text and vivid illustrations that 
will appeal to a range of age groups, the 
book describes how Youyou, whose work 
overlapped with China’s Cultural Revolu- 
tion, overcame numerous challenges, from 
limited resources to doubts sewn by her 
male colleagues. It also reveals how an an- 
cient text inspired the simple method that 
would ultimately be used to extract arte- 
misinin, or qginghaosu—the compound that 
would prove effective against malaria—from 
sweet wormwood. By the time Youyou stood 
on the podium in Stockholm to receive her 
prize, artemisinin had saved the lives of 6.8 
million children. 

The book may be slim, but it does an 
excellent job of conveying a sense of the 
time, sacrifice, and the sheer persistence 
that often underlie revolutionary scientific 
discoveries. It also succeeds in showing that 
the scientific method can be applied to folk 
knowledge to great effect and in celebrating 
an otherwise unsung life in science. 


Tu Youyou’s Discovery: Finding a Cure for Malaria, 
Songju Ma Daemicke, Illustrated by Lin, Albert Whitman & 
Company, 2021, 32 pp. 


MIDDLE GRADES SCIENCE BOOK 


It Takes Guts 


Reviewed by Marc S. Lavine> 


The journey our food takes from the time it 
enters our bodies to when it exits is long and 
complicated, and it involves many muscles, 
tissues, and organs. Capturing the details of 
each step along the way could be an arduous 
task, but Jennifer Gardy manages to describe 
the processes of human digestion in a lively 
way in It Takes Guts. 

Each chapter of the book covers a part of 
the digestive tract, from the mouth to the 
stomach to the intestines, and describes the 
different stages of digestion that occur in 
each one. Readers learn how two sphinc- 
ters in the esophagus control the passage 
of food downward while preventing the 
upward passage of stomach acid, all under 
the automated control of our enteric nervous 
system. Sword swallowers train themselves to 
actively control these sphincters to create an 
almost straight tube from mouth to stomach, 
a skill exploited by the 19th-century German 
physician Adolf Kussmaul, who attempted to 
map out the esophagus of one such individual 
using a long tube lined with mirrors. 

Further along, readers learn about mucus 
and enzymes, gastrin and ghrelin, and 
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villi and crypts, all of which play roles in 
regulating eating and digestion. Less directly 
involved, but just as important, are other or- 
gans that participate in the digestive process. 
The liver produces bile acids, the pancreas 
produces insulin and glucagon, and even the 
appendix plays a part—it is now believed to 
act as a storage chamber for beneficial bacte- 
ria. Speaking of bacteria, many are essential 
to proper digestion, as some produce enzymes 
that humans need but cannot synthesize. 
Gardy also includes a discussion of normal 
and not-so-normal things everyone experi- 
ences from time to time—such as burping, 
farting, and more. Being able to distinguish 
healthy from unhealthy functions is often 
key to diagnosing ailments and disease, so 
this material may be particularly valuable 
to the reader. Overall, Jt Takes Guts is an 
interesting, entertaining, and informative 
book to bite into. 


It Takes Guts: How Your Body Turns Food into Fuel 
(and Poop), Jennifer Gardy, Illustrated by Belle Wuthrich, 
Greystone Kids, 2021, 152 pp. 


Snoozefest 


Reviewed by Seth Thomas Scanlon® 


Fewer and fewer adolescents are getting 
enough shut-eye (1, 2). Snoozefest is a delight- 
ful and informative new book on the science 
of sleep that may just convince budding 
young night owls to catch a few more z’s 
every evening. 

Author Tanya Lloyd Kyi delves into the 
history of sleep research with bite-sized but 
detailed summaries of important experimen- 
tal milestones, such as the development of 
electroencephalography (EEG), the character- 
ization of melatonin, and the discovery of cir- 
cadian rhythms. She also highlights the latest 
findings on why we sleep and dream, noting, 
for example, how it helps with consolidation 
of memories, formation of insights, recovery 
from trauma, integration of emotions, and 


repair of the brain’s daily wear and tear. 

The book describes how sleep can be 
beneficial for the proper functioning of the 
metabolic, cardiovascular, and immune sys- 
tems, while acknowledging that many major 
questions remain unanswered. Along the way, 
Kyi touches on some important public policy 
issues such as the dangers of sleeping pills, the 
problems associated with night-shift work, and 
the need for school systems to better accom- 
modate the “phase-delayed” sleeping patterns 
of most teens. The author even provides some 
fascinating examples of how various other 
species have managed to balance the risks of 
predation with the need for sleep. 

A few tangential topics, such as Freudian 
dream analysis, are a distraction, and the 
attention paid to alternative “treatments” for 
insomnia (e.g., acupuncture) is, in my view, 
problematic. Instead, the author might have 
devoted more space to subjects such as how 
animals dream and sleep. The drawings by 
Valéry Goulet are fun and inviting but mostly 
ornamental—the relative dearth of scien- 
tifically informative illustrations feels like a 
missed opportunity. 

Still, this book is certainly not the “snooze- 
fest” suggested by the title. Rather, it is a 
very engaging and informative volume on an 
enormously important area of research that 
should rouse even the most bleary-eyed teen 
from their slumber. 


Snoozefest: The Surprising Science of Sleep, 
Tanya Lloyd Kyi, Illustrated by Valéry Goulet, 
Kids Can Press, 2021, 80 pp. 


REFERENCES AND NOTES 


1. J.M. Twenge, Z. Krizan, G. Hisler, Sleep Med. 39, 47 (2017). 
2. S.Paruthietal.,J. Clin. Sleep Med. 12,1549 (2016). 


Stolen Science 


Reviewed by Yevgeniya Nusinovich’ 


When children learn about science, they 
are often given the impression that most 
major discoveries were made by white men. 
However, Stolen Science reveals that there 
is often more to the stories that perpetu- 
ate this idea, and for every Marie Curie 
who was given credit for her work, there 
are many women and others throughout 
history whose scientific contributions went 
unacknowledged. 

In this book, Ella Schwartz presents the 
stories of 13 scientists and inventors who 
contributed much to science and in many 
cases are still forgotten. A pattern emerges 
across each story: A woman, an enslaved 
person, or another member of a marginalized 
group makes a major discovery or invention, 
but a white man or men get all the credit. In 
some cases, the men in question intention- 
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ally undermined the person who made the 
discovery, as happened to crystallographer 
Rosalind Franklin, whose crucial work on 
the structure of DNA was minimized by her 
competitors. In other cases, the individual’s 
contribution was recognized only after it 
was later confirmed by someone else, as 
happened to Carlos Juan Finlay, a Cuban 
physician who identified how yellow fever 
is transmitted. The world gave credit to US 
army physician Walter Reed, even though 
Reed himself readily acknowledged that 
Finlay had made the discovery. 

Each story in this book is supported by a 
separate section of text explaining the science 
behind the work in question using terms 
that middle-school-aged or even younger 
children should be able to follow. In addition, 
each of the featured individuals is beautifully 
portrayed by illustrator Gaby D’Alessandro, 
who combines images of the scientists and 
inventors with symbols of their work. 
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This is a very important book that not 
only gives credit where it is long overdue 
but should also help inspire future scientists 
from all backgrounds. Readers will be left 
wondering how much better the world would 
be if everyone was allowed to develop their 
talents to the fullest. 


Stolen Science: Thirteen Untold Stories of 
Scientists and Inventors Almost Written out of 
History, Ella Schwartz, Illustrated by Gaby D Alessandro, 
Bloomsbury Children's Books, 2021, 128 pp. 


How to Build a Human 


Reviewed by Sacha Vignieri* 


We humans attribute our domination of 
Earth to our big brains, which have al- 

lowed us to adapt to, and convert, land- 
scapes all over the planet. However, our 


oversized cortex is a relatively recent de- 
velopment. In the introduction of her book 
How to Build a Human, Pamela Turner 
emphasizes that just 3 million years ago, an 
extraterrestrial looking for the animal with 
the largest brain would have selected an 
ancestor of modern dolphins. 

In the pages that follow, Turner takes 
the reader step by step through human 
evolution, from an ancient ape ancestor 
that lived around 20 million years ago to 
today. Our understanding of this lineage 
has greatly expanded over the past few 
decades, and she cleverly describes how 
humanity’s evolutionary tree is full of many 
branches, each distinctive but in many ways 
similar. The journey takes place in seven 
steps, beginning with “We stand up,” which 
describes the evolution of bipedal locomo- 
tion, an innovation that likely first emerged 
in African apes around 7 million years ago. 
It concludes with “We become storytellers,” 
which summarizes how early Homo species 
developed art, a likely expression of more 
complex language skills. 

Engaging illustrations by John Gurche, a 
respected paleoartist, accompany Turner’s 
story. His artwork conveys the myriad 
species of hominoid in ways that clearly 
display their “humanity,” rendering them in 
ways that allow readers to recognize these 
species as our relatives. The book is also 
full of photographs of fossils, art, and tools 
that further bring human ancestors and 
relatives to life. Readers will leave this book 
with an understanding of how we came to 
be who we are and with an appreciation 
for the fact that, for much of our history, 
we lived alongside other species descended 
from the same ancient ancestor. 


How to Build a Human: In Seven Evolutionary 
Steps, Pamela S. Turner, Illustrated by John Gurche, 
Charlesbridge, 2022, 176 pp. 


HANDS-ON SCIENCE BOOK 


Forensics for Kids 


Reviewed by Michael A. Funk® 


As we move through the world, we leave 
traces of ourselves that can lead back to 
us. When a violent crime occurs, police 
can turn to field and laboratory analysis to 
determine what has happened at the crime 
scene and whether a suspect was present. 
In Forensics for Kids, Melissa Ross traces 
how forensic science emerged in Britain 
and the United States in the 1800s as 
policing practices mixed with rudimentary 
medicine and chemistry. This combina- 
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tion has been refined over the past two 
centuries, with new sources of evidence, 
analytical methods, and standards required 
for achieving a guilty verdict. 

Forensics for Kids consists of briefly 
described anecdotes and overviews of spe- 
cific cases, many of which will be familiar 
from detective novels and news stories. 
The scientific explanations of how various 
forensic techniques are or were applied 
are short, but the book does include many 
activities that children can do to simulate 
the process of collecting and analyzing 
evidence. For example, readers can try 
fingerprint matching, identifying a mystery 
kitchen chemical, or creating a database of 
shoe prints. Many of these activities can be 
done in a group and might inspire games or 
creative adaptations. 

Although detective work is important to 
solving crimes, the book does not examine 
how systemic biases or unreliable, unscien- 
tific forensic methods might distort criminal 
trials and lead to wrongful convictions. Blood 
splatter and bite mark analyses, for example, 
are presented with no mention that the ac- 
curacy of these methods has been vigorously 
questioned by the scientific community. 

In one section, the book describes 
automated DNA testing and aggregated 
DNA databases as tools to “keep dangerous 
criminals from being released to the streets,” 
without discussion of the privacy and due 
process concerns these methods can pose. 
Although this context may seem unnecessary, 
it is critical for readers to understand that 
such methods have limits and that errors can 
result in serious consequences for defendants. 


Forensics for Kids: The Science and History of 
Crime Solving, With 21 Activities, Melissa Ross, 
Chicago Review Press, 2022, 144 pp. 


Physics for Kids 


Reviewed by Pamela J. Hines® 


With Physics for Kids, Liz Lee Heinecke 
adds to her series of books dedicated to 
hands-on, at-home science experiments. 
Each chapter of this new installment begins 
with a full-page color portrait of a scientist 
whose discoveries underlie the chapter’s ex- 
periments. The individuals chosen represent 
the diversity that enriches scientific discov- 
ery. A short biography introduces the reader 
to the featured scientist and describes how 
a particular discovery they made is applied 
in the real world. The book then provides 
instructions for hands-on experiments that 
explore topics such as relativity, thin films, 
and sunspots, to name just a few. 

Readers learn about Italian physicist 
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Laura Bassi, born in 1711, who studied 
atmospheric electricity at her country 
house when experiments on lightning were 
banned in the nearby city of Bologna. Hei- 
necke suggests exploring Bassi’s fascination 
with static electricity by watching how an 
electrically charged balloon can chase soap 
bubbles on a plate. Another researcher, 
American engineer Christine Darden, born 
in 1942, worked at NASA doing calcula- 
tions for the first space missions and went 
on to study aircraft wing design. Heinecke 
encourages readers to follow in Darden’s 
footsteps by folding paper airplanes to see 
how wing shape affects aerodynamics. 

The experiments in this book are well 
chosen to suit students in the upper grades 
of elementary school or in middle school, and 
the supplies needed are simple and readily 
available. Safety tips are established early on, 
keeping the focus on carefully exploring the 
natural world. For each set of experiments, 
which are illustrated by photographs along- 
side the how-to protocol, a short paragraph 
explains the physics that drives the phenome- 


non under consideration. For those who want 
to dig deeper, the author suggests potential 
modifications. This would be a handy book to 
have around for any youngster interested in 
the physics of the natural world. 


Physics for Kids: Science Experiments and 
Activities Inspired by Awesome Physicists, Past 
and Present, Liz Lee Heinecke, Illustrated by Kelly Anne 
Dalton, Quarry Books, 2022, 128 pp. 


The Science 
Spell Book 


Reviewed by Valerie Thompson” 


As alchemists and astrologers were sup- 
planted by chemists and astronomers, 
many mysterious elements of the natural 
world yielded to scientific explanations. 
But this does not mean that there is no lon- 
ger a place for magic in science. “Creativity 
and awe are integral traits of any success- 
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ful scientist, and they are rooted in the 
imaginative exercises of childhood,” notes 
Cara Florance in The Science Spell Book, a 
compendium of mystical experiments for 
the science-minded spell-caster. 

The collection’s 25 activities are divided 
into five categories: “infusions,” exploring 
plants as pigments and indicators; “illumina- 
tion,” probing the nature of light; “sorcery,” 
interrogating electromagnetism and other 
fundamental forces; “alchemy,” describing 
physical and chemical changes; and “mim- 
icry,” looking to plants and animals for engi- 
neering inspiration. Moody photos featuring 
minerals, dried herbs, and candles illustrate 
the activity’s end products—a violet solution 
that changes color when breathed upon; 
earthy pigments derived from coffee, flower 
petals, and clay; insect wings and acorns 
coated in borax crystals. 

Fanciful titles and charming incantations 
infuse each activity with magical touches, 
but science lurks just below the surface. 
“Current flows through the coiled course. 
Domains align to exert their force,” reads 
the opening of “Magnetic Manifestation,’ for 
example, which guides readers through the 
steps required to create an electromagnet. 
A short summary of the principle at play 
comes next, followed by a list of supplies, 
steps, and a short section that elaborates on 
the science underlying the activity. Readers 
are likely to have some of the required items 
on hand—rubbing alcohol, cream of tartar, 
and turmeric, for example. Others are more 
unusual—think butterfly pea flower tea, 
hobby motors, LEDs—and might need to be 
purchased in advance. 

“Learning the scientific reasoning 
behind a mysterious event does not take 
away from the magic but instead exposes 
more exciting mysteries and questions to 
be answered,” observes Florance. Readers 
need look no further than this delightful 
book for proof that this is so. 


The Science Spell Book: Magical Experiments for 
Kids, Cara Florance, Sourcebooks eXplore, 2022, 224 pp. 


Funky Fungi 


Reviewed by Michael A. Funk® 


Whether a strange color sticking out of dull 
brown leaves, a drooping mane of tendrils 
attached to a tree, or maybe just an odd funk 
in the air after a rainy day, fungi are amazing 
and diverse and a bit strange. In Funky Fun- 
gi, Alisha Gabriel and Sue Heavenrich convey 
an infectious love for finding fungi in the wild 
and explain how these mysterious organisms 
fit into our ecosystems and lives. 


SCIENCE science.org 


The book is packed with interesting and 
up-to-date facts about this often-misunder- 
stood kingdom of life. Mushrooms—both 
those that are edible and those that should 
not be eaten—are well represented but are 
just a small sliver of the diversity discussed. 
Yeasts, molds, insect-zombifying Cordyceps, 
and even a honey mushroom that may be the 
largest organism on Earth are all on the figu- 
rative menu. Speaking of eating, don’t expect 
tips for distinguishing poisonous mushrooms 
from safe—this book is not a foraging guide 
and doesn’t try to be. Although there is a sec- 
tion on fungi as food, Gabriel and Heavenrich 
stick to what readers can find in a supermar- 
ket. Other ways to appreciate fungi include 
growing a mold garden, dyeing a piece of fab- 
ric with pigments extracted from mushrooms, 
and starting a compost heap. 

In addition to fungus facts, the book in- 
cludes profiles of mycologists, beautiful high- 
resolution photos, and lots of activities, both 
indoor and outdoor. Although some of the 
terminology and concepts in the text might 
challenge younger readers, the denser sec- 
tions are also packed with engaging examples 
and will hopefully inspire more reading and 
adventuring. There are a few online resources 
mentioned near the end and suggestions 
for field guides if readers want to take their 
forays to the next level. 


Funky Fungi: 30 Activities for Exploring Molds, 
Mushrooms, Lichens, and More, Alisha Gabriel and Sue 
Heavenrich, Chicago Review Press, 2022, 128 pp. 


YOUNG ADULT SCIENCE BOOK 


The Complete 
Guide to Absolutely 
Everything (Abridged) 


With wry, irreverent British humor, Adam 
Rutherford and Hannah Fry tackle a host 
of big, albeit seemingly random, ques- 
tions, devoting a chapter of The Complete 
Guide to Absolutely Everything (Abridged) 
to each. These range from the age-old and 
deep (Do we have free will? What would 
aliens look like?) to the droll (Does my dog 
love me? Are we all suckers?). Most are not 
definitively answered, but the quest for 
solutions is engaging and enjoyable, pep- 
pered with pop culture, literary, and classi- 
cal references and easy-to-digest analogies. 
Rutherford and Fry focus less on objec- 
tive scientific truths and more on us. For 
example, readers journey through human 


timekeeping, from sundials to “leap seconds,” 
and learn how the modern financial system 
hinges precariously on subterranean fiber- 
optic cables that deliver atomic time to banks. 

Instead of Einstein’s time dilation, expect 
to learn about time dilation inside the human 
body (time seems to slow when we are in 
peril or cut off from sunlight). And Darwin’s 
theory of evolution takes a backseat to his 
debunked theories of human emotion. 

Despite mostly glossing over technical de- 
tails, The Complete Guide does highlight some 
fascinating recent scientific findings—the fact 
that rats appear to feel regret, for example, 
and a case in which a woman appeared to be 
able to smell Parkinson’s disease on a patient’s 
clothing months before they began exhibiting 
symptoms—as well as a number of state-of- 
the-art experiments. 

Rutherford and Fry also shine a light on 
important issues that plague modern science, 
such as the “File Drawer Problem,’ wherein 
“shiny novelties” are prioritized for publica- 
tion over essential but less flashy experiments 
that verify existing results, and the related 
“replication crisis.” 

Readers are likely to finish The Complete 
Guide with a pocketful of intriguing anecdotes 
with which to entertain at cocktail parties and 
a feeling of endearment toward humankind, 
who dare to ask questions they are “singularly 
ill-equipped to answer.” 


For a full-length review of The Complete Guide to Absolutely 
Everything (Abridged), see “Confronting age-old questions, 
big and small,” Science 375, 35 (2022). 


The Complete Guide to Absolutely Everything 
(Abridged): Adventures in Math and Science, Adam 
Rutherford and Hannah Fry, W. W. Norton, 2022, 304 pp. 


Sticky 


Reviewed by Marc S. Lavine® 


In Sticky, Laurie Winkless takes readers 

on a tour of what makes some things stick 
together and others slide past one other, 
describing how a range of factors can change 
these interactions. The book includes an 
exploration of materials such as paints, per- 
manent and temporary adhesives, nonstick 
coatings, tires, gecko feet, and shark skin. For 
each, the factors that make them sticky or 
slippery involve chemical and physical attrac- 
tions or repulsions. In fact, as Winkless notes, 
neither “sticky” nor “slippery” are technical 
terms, and there is no simple numerical scale 
to delineate between the two states that we 
often think of as opposites. 

Winkless takes a very broad view of 
surfaces and interfaces, such that the book 
includes detours to related topics that are of- 
ten needed to understand the science under 
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consideration. For example, to understand 
how air currents interact with the surface of 
golf balls and airplane wings, the reader re- 
quires a crash course in aerodynamics. Like- 
wise, Winkless’s discussion of the factors 
that affect the speed of high-performance 
race cars goes well beyond the engineering 
of the car’s tires or brakes, describing how 
specific track and race day conditions affect 
the chemistry and design of many of the 
car’s components. 

We do not often think of slipperiness in the 
context of earthquakes, observes Winkless, 
however, it is the stick-slip motion of tectonic 
plates that leads to cracks in the earth and 
the movement of faults. Meanwhile, in 
discussing the surface properties of ice, we 
learn about a raging scientific debate in the 
otherwise sedate sport of curling, with two 
competing theories of the underlying factors 
that determine the curl of a stone. 

Through a wide range of topics, including 
some that are likely to be less well known, 
Sticky offers readers an insider’s guide to the 
secret science of surfaces. 


Sticky: The Secret Science of Surfaces, 
Laurie Winkless, Bloomsbury Sigma, 2022, 336 pp. 


Astroquizzical 


Reviewed by Adrian Cho” 


Ever wonder why stars twinkle, what will 
happen to Earth when the Sun dies, or how 
we know that the Universe is expanding? 
This lavishly illustrated, fact-packed book an- 
swers these questions and more in a way that 
is both accessible and conceptually meaty. 
The reader comes away with a solid introduc- 
tion to astronomy, planetary science, stellar 
and galactic astrophysics, and cosmology. 

The reader takes a tour of the cosmos 
guided by Jillian Scudder, an astrophysicist 
at Oberlin College and Conservatory, who 
since 2013 has fielded questions on her blog 
of the same title. Moving ever outward, she 
opens with the night sky, takes a close look 
at the Moon, and swings through the plan- 
ets in our solar system. She examines the 
Sun and other stars, zooms out to the galax- 
ies, and finally contemplates the Universe as 
a whole. 

Among the questions, Scudder sprinkles 
10 thought experiments, musings on some 
hypothetical scenario, such as, “What 


would happen if we split the Sun in half?” 
Some of these digressions are pretty quirky. 
Scudder asks, for example, what would 
happen if you could open a door on Earth’s 
surface and step out on the Moon? The 
answer: nothing good, thanks to the 1480 
kilometer per hour wind that would spit 
you through the door. 

The real stars of the show are the cos- 
mos’s own treasures. The book features doz- 
ens of jaw-dropping photos of marvels such 
as Olympus Mons, the 24-kilometer-tall vol- 
cano on Mars, and the fiery face of the Sun 
speckled with coronal mass ejections. The 
famous photo of countless galaxies taken by 
the Hubble Space Telescope may well bring 
the viewer to an awestruck stop. Thanks to 
Scudder’s accessible prose and the luscious 
illustrations, readers will come away with 
many answers, but likely even more ques- 
tions to fuel their curiosity. 


Astroquizzical: Solving the Cosmic Puzzles of 
our Planets, Stars, and Galaxies (The Illustrated 
Edition), Jillian Scudder, |con Books, 2021, 224 pp. 


The Last Days of the 
Dinosaurs 


Reviewed by Victoria Arbour? 


Sixty-six million years ago, the beginning of 
the end of the age of dinosaurs came crash- 
ing down in the form of an asteroid that 
struck what is now the Yucatan Peninsula 
in Mexico. The details of this cataclysmic 
event, and what happened afterward, have 


rarely received the feature-length treat- 
ment that Riley Black presents in The Last 
Days of the Dinosaurs. 

In just over 300 pages, Black pulls 
together decades of scientific research 
on the demise of the nonavian dinosaurs 
into a deeply compelling narrative of both 
luck and misfortune in the face of almost 
unimaginable calamity. Starting just a few 
days before the asteroid impact, Black 
centers the story on the animals present in 
the Hell Creek Formation of the western 
United States, a geological unit that has 
been studied for more than 100 years and 
that provides the best window we have into 
the time before and after the mass extinc- 
tion. Through the eyes of the inhabitants of 
Hell Creek, readers pass through the mo- 
ment of impact and then the first hour, day, 
month, and year of the Cenozoic era. 

The story continues with chapters set 
one hundred, one thousand, one hundred 
thousand, and one million years after the 
asteroid impact, charting the reshaping of 
the world. Each chapter ends with a detour 
to somewhere else on the planet—Ant- 
arctica, India, and the Atlantic Ocean, for 
example—providing a global balance to this 
otherwise tightly focused narrative. 

Unlike the other mass extinctions re- 
corded in the fossil record, extinction for 
most species at the end of the Cretaceous 
probably happened within a few hours or 
days after the asteroid impact. Black’s writ- 
ing captures the horror of the impact event 
without lingering on doom and gloom, 
instead keeping a steady momentum focus- 
ing on survival and change. 

Black recounts stories of lucky survi- 
vors whose descendants will be familiar 
to many readers—frogs, snakes, turtles, 
and crocodiles—and organisms that rarely 
get a mention in dinosaur stories, such as 
the planktonic coccoliths and coil-shelled 
ammonites. And of course, as the nonavian 
dinosaurs pass into history, we see the 
changes that happen to the ancestors of 
today’s mammals, who evolved into new 
forms in the asteroid’s aftermath. 


For a full-length review of The Last Days of the 
Dinosaurs, see “Inside the dinosaurs’ demise,” 
Science 376, 360 (2022). 


The Last Days of the Dinosaurs: An Asteroid, 
Extinction, and the Beginning of Our World, 
Riley Black, St. Martin's Press, 2022, 304 pp. 


10.1126/science.adf7725 
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Psychedelic-assisted psychotherapy is one model of psychedelic medicine that could be used to treat some neuropsychiatric disorders. 


The therapeutic potential of psychedelics 


The development of psychedelics as medicines faces several challenges 


By Emmanuelle A. D. Schindler*? and 
Deepak Cyril D’Souza** 


sychedelics are reported to have rapid- 

onset and long-lasting therapeutic 

benefits after a single or few doses. 

Sustained (1 year or more) clinical 

benefits have been reported in depres- 

sion and smoking cessation studies 
after just two or three doses of psilocybin 
combined with psychotherapy (J, 2). By con- 
trast, conventional medications for neuropsy- 
chiatric disorders take days to weeks to begin 
working and need to be taken daily over pro- 
longed periods and sometimes indefinitely. 
Other potential applications for psychedelics 
include treatment of cancer-related anxiety, 
obsessive-compulsive disorder, headache 
disorders, and phantom limb syndrome. 
Although arguably paradigm shifting, a num- 
ber of unanswered questions remain about 
psychedelics as medicines, including the defi- 
nition of a psychedelic drug, the mechanism 
of therapeutic effects, optimizing clinical 
benefit, and verifying safety. 
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Lysergic acid diethylamide (LSD), psilocy- 
bin, N,N-dimethyltryptamine (DMT), 2,5-di- 
methoxy-4-iodoamphetamine (DOI), and 
mescaline are some of the agents collectively 
categorized as classic psychedelics. These 
drugs are all agonists at 5-hydroxytrypta- 
mine (5-HT; serotonin) 2A (5-HT,,) receptors 
and produce characteristic acute psychedelic 
effects, which include alterations in percep- 
tion, feeling, and consciousness (3). Although 
other compounds produce psychedelic-like 
effects as well as clinical benefits [such as 
ketamine and 3,4-methylenedioxymetham- 
phetamine (MDMA; “ecstasy”)], they are 
pharmacologically distinct, and so the focus 
here is on classic psychedelics. 

Narrowly defining a drug class by one set 
of its effects (psychedelic) can be problematic 
because it colors the perception and may ul- 
timately limit the breadth of its application. 
For example, patients may be hesitant to 
take an antidepressant for a nonpsychiatric 
condition, such as peripheral neuropathy or 
migraine, simply because of the class name. 
Alternate terms offered for psychedelics 
include “psychoplastogens” or “neuroplas- 
togens” (4), which remove prejudice and 
highlight the ability of these drugs to induce 
change, although not necessarily the distinct 
dosing regimen. Borrowing from headache 
medicine, transitional medications are those 
taken for a short time and that suppress 


headache for a prolonged period well beyond 
the treatment itself (steroid pulse). A com- 
pound term such as “transitional neuroplas- 
togen” captures the notions of long-lasting 
change after a brief treatment period. 

The mechanism of therapeutic effects of 
psychedelics is widely queried but remains 
unclear. 5-HT,, receptor antagonists block 
acute psychedelic effects, but whether they 
also block therapeutic effects requires further 
investigation. To what extent the many other 
direct or indirect targets of psychedelics— 
such as 5-HT,,, 5-HT,,, and 5-HT,,, receptors; 
dopamine receptors; a-adrenergic receptors; 
monoamine transmission; and glutamater- 
gic transmission—contribute to therapeutic 
effects is also not known (3). Once bound to 
a receptor, a ligand may also activate one or 
more intracellular processes. For example, 
the B-arrestin signaling pathway has been 
suggested to be relevant for antidepressant 
effects of 5-HT,, receptor activation but not 
psychedelic effects (5). Psychedelics also have 
numerous physiological effects, including 
anti-inflammatory, hormonal, and epigenetic 
effects, which have pathological relevance 
in such conditions as depression, substance 
abuse, and headache disorders (6). 

How any of these transient effects on re- 
ceptors or biological systems might explain 
sustained therapeutic effects is unknown. 
The initiation of a cascade of events with en- 
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during neuroplastic effects at the cellular and 
network level is one plausible and popular 
theory. Classic psychedelics have been shown 
in cellular and in vivo preclinical models to 
promote synaptogenesis and increase corti- 
cal dendritic spine size, number, and com- 
plexity (7), with some effects lasting a month 
(8). In pigs, a single intravenous dose of psilo- 
cybin was shown to induce lasting (7 days) in- 
creases in cortical and hippocampal synaptic 
density (9). Interestingly, experiments identi- 
fied some, but not other, cellular changes to 
be 5-HT,, receptor-mediated. Whether these 
neuroplastic cellular changes are related to 
durable therapeutic effects could be investi- 
gated in models of human disease or human 
patients. For example, in patients with treat- 
ment-resistant depression, changes in brain 
resting-state functional connectivity the day 
after completing a two-dose regimen of psi- 
locybin correlated with a lasting clinical im- 
provement at 5 weeks (0). Moreover, various 
changes in brain connectivity persisted for 
1 month after a single dose of psilocybin, as 
did increases in positive mood and decreases 
in anxiety (11). These postpsychedelic con- 
nectivity changes suggest an association with 
and perhaps a source for therapeutic effects. 
However, replication of findings in placebo- 
controlled studies, over a longer term, and 
with clearly delineated modeling and analyti- 
cal parameters is necessary to show this more 
conclusively. In addition, comparisons with 
other drugs that also induce neuroplastic 
changes, such as ketamine (8), are necessary 
to characterize signature effects of psychedel- 
ics. Studies in different patient populations 
will also be required to identify the changes 
relevant to specific disorders, such as hypo- 
thalamic function in cluster headache (6). 
The neuroplastic effects of psychedelics 
may serve to open a therapeutic window, 
allowing other drugs or treatments to take 
effect. For example, psychedelic drug stud- 
ies in depression include a course of psycho- 
therapy, a standard treatment used in depres- 
sion. In a case of phantom limb syndrome, 
psilocybin mushrooms were reported to have 
synergistic and lasting therapeutic effects 
when used in conjunction with mirror ther- 
apy, a standard rehabilitative therapy used to 
reverse aberrant somatosensory cortex reor- 
ganization in that condition (12). These psy- 
chedelic-assisted therapies use the drug with 
an existing disease-specific therapy. However, 
psychedelics may also have independent 
treatment effects. Cluster headache patients 
have been self-administering them as stand- 
alone treatment for decades, and clinical 
trials in headache disorders have modeled 
this method. Ultimately, the independent 
and interactive effects of psychedelics with 
other disease-specific therapies must be in- 
vestigated systematically. The type and du- 
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ration of adjunctive treatment must also be 
considered (which form of psychotherapy for 
depression). The need to repeat drug treat- 
ment at certain intervals is also anticipated, 
although existing clinical trials are relatively 
short. Furthermore, some study protocols in- 
clude curated decoration, music, and so on 
during drug dosing, as well as the presence 
of therapists to guide or enhance the expe- 
rience. The specific settings and interactions 
that are necessary and optimal for therapeu- 
tic outcomes have not been systematically 
examined. Notably, whether additional treat- 
ments and procedures can be implemented 
on a large scale and reach all populations in 
need must be factored into the development 
of psychedelics as medicines. 

Positive correlations between the magni- 
tude of the psychedelic experience and thera- 
peutic benefit have not been consistently 
observed. In headache disorders, acute psy- 
chedelic effects appear unrelated to thera- 
peutic outcomes (13). Across studies, the 
scales (and subscales) used to measure psy- 
chedelic effects are not aligned, leaving the 
relationship between acute subjective effects 
and clinical effects unclear. Furthermore, dif- 
ferent psychedelics produce distinct acute ex- 
periences. Seeking to understand the origin 


“The neuroplastic effects of 
psychedelics may serve to open 
a therapeutic window...” 


of specific acute perceptual and other sub- 
jective effects and their relevance in treating 
particular neuropsychiatric conditions could 
further optimize treatment. This could be 
done through a number of complementary 
experimental manipulations: comparing the 
therapeutic efficacy of subpsychedelic versus 
psychedelic doses, conducting wide dose- 
response studies, blocking the psychedelic 
effects with targeted receptor antagonists, 
using analogs that lack psychedelic effects 
[such as 2-bromo-LSD (BOL)], comparing 
classic psychedelics or using other psycho- 
tropic drugs with distinct pharmacological 
profiles (such as MDMA), or administering 
drugs to individuals while they are in natural 
or induced sleep. If the acute psychedelic ef- 
fects of these drugs are central to some thera- 
peutic effects, it will be critical to determine 
what level and duration are necessary. For 
example, intravenous DMT showed next-day 
(rapid) antidepressant effects and produced 
acute psychedelic effects for ~30 minutes 
(74), contrasting oral psilocybin’s 6 hours or 
LSD’s 12 hours of acute effects. A shorter psy- 
chedelic experience, if resulting in the same 
clinical benefit, would be more logistically 
feasible and palatable. 


Unlike the development of drugs in the 
pharmaceutical industry, there is a massive 
amount of information about psychedelics 
available to the public before their imple- 
mentation as medicines. Media coverage 
is not bound by the standards of accuracy 
of scientific reporting and seems biased to- 
ward covering the “universally life-changing” 
abilities of psychedelics. This raises expecta- 
tions for success in clinical trials, which may 
only partially be tempered by education (J5). 
Study results are also affected by potential 
unblinding from the unmistakable acute 
effects of psychedelics and lack of acute ef- 
fects with placebo (75). An active control 
that produces acute subjective effects may 
minimize unblinding, although identifying 
such an agent is challenging. A low dose of 
the psychedelic being investigated may be 
used, but this too may produce lasting thera- 
peutic effects (73). A related drug with over- 
lap in several dimensions of the psychedelic 
experience, such as ketamine, may be tried, 
although ketamine also has lasting clinical ef- 
fects. The use of other nonclassic psychedel- 
ics with no known therapeutic effects (such 
as salvinorin A) might be considered. Other 
control agents that have been used include 
niacin and diphenhydramine, although these 
drugs do not entirely substitute for acute ef- 
fects, particularly of higher doses of psyche- 
delics. Additional methods for maintaining 
blinded conditions include recruiting psy- 
chedelic-naive subjects, emphasizing inter- 
personal variability in the acute effects, and 
incomplete disclosure about drugs or doses 
that may be received (15). 

Given the increase in psychedelic re- 
search, together with the surge in popu- 
lar and commercial interests, the safety of 
psychedelics must not only be revisited but 
considered in the context of current and fu- 
ture use. Historically, the use of psychedelics 
has involved the infrequent consumption of 
moderate to high doses. In research, limited 
dosing (single or a few doses) is studied, and 
the drug is administered under controlled 
conditions with medical and psychiatric 
oversight to carefully screen and prepare 
participants. Such practices support safety 
and tolerability and deter misuse. However, 
practices that deviate from this model are 
emerging. One version of the practice of “mi- 
crodosing” involves repeated exposure to low 
or subperceptual doses of a psychedelic over 
a prolonged period. Although “micro” might 
sound appealing and denote safety, there is 
no evidence that the frequent and long-term 
use of psychedelics (at any dose) is safe. As a 
case in point, the LSD-derivative methyser- 
gide, an effective migraine and cluster head- 
ache preventive (taken daily), was removed 
from the market after cases of cardiac valve 
fibrosis and other tissue fibrosis emerged. 
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The fibrogenic effects are related to 5-HT,,, 
receptor activation, and although psychedel- 
ics have varying affinities for this receptor 
chighest for ergot derivatives), frequency and 
duration of exposure must be considered in 
the pharmacodynamics of these new, unveri- 
fied regimens. 

Psychedelics may also have acute thera- 
peutic effects (for example, aborting a head- 
ache attack). Although potentially acceptable 
for conditions that require infrequent use, 
the frequent consumption of these drugs 
for the acute management of a chronic and/ 
or persisting condition is not only impracti- 
cal but risks tolerance and loss of efficacy 
and has not been systematically studied 
for safety. Indeed, an ongoing challenge 
within pain management is the reliance on 
abortive rather than preventive treatment, 
which leads to sensitization and dependency. 
Psychedelics have historically failed to dem- 
onstrate addictive properties, but the neu- 
ropsychological impact of frequent (and po- 
tentially increasing) use needs further study. 
In addition to pharmacology and purpose of 
use, other factors that contribute to how a 
drug is used (or misused) include availability, 
perception, commercialization, and promo- 
tion. Furthermore, the idea that psychedelics 
may be used outside of a diagnosed medical 
condition—say, for general life enhancement 
or improved concentration—is intriguing 
but will also require formal investigation. 
Without dedicated study, new regimens 
and applications may have unexpected out- 
comes. The comprehensive investigation of 
psychedelics and their implementation as 
legitimate medicines remain valuable but 
substantial undertakings. & 
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Research using herbarium specimens collected over nearly 200 years demonstrates how the native North American 
weed waterhemp (shown here invading a soybean field) adapted to agricultural practices over space and time. 


AGRICULTURE 


Herbaria reveal cost 
of the Green Revolution 


Rapid weed evolution is exposed by genome 
sequencing of natural history collections 


By Katherine Waselkov! and Kenneth M. Olsen? 


visitor to the US Midwest will be im- 
mediately struck by the sheer scale of 
industrially farmed corn and soybean. 
These fields are intensively managed 
artificial ecosystems, from their plant- 
ing and harvesting timelines to the 
fertilizers and pesticides that are continually 
applied. Evolutionary biologists have long 
presumed that weeds are under strong natu- 
ral selection to adapt to this anthropogenic 
ecosystem, which first appeared in the mid- 
20th century’s “Green Revolution” in agricul- 
ture. On page 1079 of this issue, Kreiner et 
al. (1) report that the selection pressure on 
weeds in modern agricultural fields is higher 
than estimates from most other natural sys- 
tems (2). The authors leverage historical 
samples in natural history collections to tem- 
porally link the adaptation of the agricultural 
weed waterhemp to the Green Revolution. 
The intensification of agriculture in the US 
and Canada has resulted in higher crop yields 
on less acreage in exchange for greater use 
of nitrogen-based fertilizers and pesticides. 
However, the maintenance of crop monocul- 
tures in these biodiversity-poor ecosystems 
spurred an arms race against weeds, insect 
pests, and microbial pathogens. To date, 267 
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plant species have evolved resistance to at 
least one chemical group of herbicides (a type 
of pesticide) meant to control their presence 
in agricultural fields (3). This intense, hu- 
man-mediated selection pressure increased 
in the late 1990s with the widespread adop- 
tion of soy, cotton, and corn varieties that 
were genetically modified to resist glypho- 
sate pesticides. This chemical (commercially 
known as Roundup) is currently the most 
popular herbicide in the US (4). Early fears 
about the escape of modified genes from ge- 
netically modified crops have largely been 
allayed through careful design and regula- 
tion. However, the proliferation of weeds that 
have evolved resistance to glyphosate largely 
through naturally occurring mutations has 
been an unanticipated consequence of the 
commercialization of glyphosate-resistant 
crops (5). As the continuous, exclusive use 
of Roundup leads to the emergence of more 
glyphosate-resistant weed populations and 
species (currently 56), farmers are resorting 
to older chemicals or more expensive weed- 
control methods. One of the most pervasive 
and damaging glyphosate-resistant weed spe- 
cies is the North American native waterhemp 
(Amaranthus tuberculatus). 

Waterhemp is unusual among agricultural 
weeds in that individual plants are either 
male or female and thus must cross-pollinate 
to reproduce (unlike many weeds that self- 
pollinate). Encountering a mate is enabled 
by wind pollination and enormous popula- 
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tion sizes as well as a staggering reproductive 
capacity in optimal conditions (up to 1 mil- 
lion seeds per female plant) (6). Waterhemp’s 
native range is from the US Great Plains to 
southern Ontario, Canada, where its original 
habitat was the periodically flooded margins 
of bodies of water. Reconstructing the histori- 
cal invasion of wild waterhemp into agricul- 
tural fields is complicated by the existence of 
two taxonomic varieties within the species (7). 
The variety rudis in the western part of the 
species range (west of the Mississippi River) 
had preexisting characteristics that allowed it 
to colonize human-disturbed environments. 
Its human-assisted migration eastward into 
the range of the variety tuberculatus began 
in the mid-1800s, marking the origin of the 
agricultural weed (8). Waterhemp was not a 
major agricultural problem until the 1990s, 
when it began to evolve resistance to her- 
bicides. To date, various US populations of 
waterhemp have evolved resistance to seven 
different chemical classes of herbicides, and 
many populations show resistance to mul- 
tiple chemicals (3). 

The study of Kreiner e¢ al. is the culmina- 
tion of several previous studies on waterhemp 
evolution (9-11). The authors used whole-ge- 
nome sequence data to identify all genes with 
substantial differences between geographi- 
cally paired agricultural and natural popu- 
lations. The resulting candidate genes for 
agricultural adaptation will provide a valu- 
able starting point for future studies aimed 
at identifying and functionally confirming 
genes and mutations involved in weedy traits 
(including herbicide resistance and other as- 
yet-unknown traits for agricultural adapta- 
tion). Unexpectedly, their estimates of very 
strong selection on three common herbicide- 
resistance genes are in the opposite direction 
predicted by most previous literature. The 
fate of a resistance mutation in a population 
is determined in part by its fitness. Mutations 
that suffer little or no fitness cost are more 
likely to persist in the absence of herbicide. 
The fitness costs of being herbicide resistant 
in natural environments appear to be greater 
than the benefits of being herbicide resistant 
in agricultural environments. The fitness 
costs of carrying an herbicide-resistance mu- 
tation in the absence of herbicides depend on 
the particular mutation, the other genes pres- 
ent in the genome, and the other selection 
pressures in the environment. The estimates 
of Kreiner et al. are the first to be based on 
empirical measurements in a nonexperimen- 
tal context (72). 

The most interesting aspect of the study 
of Kreiner et al. is the detection of tempo- 
ral changes in agriculture-associated gene 
frequencies and the measurement of selec- 
tion over time. These inferences were made 
possible by DNA extraction and whole-ge- 
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nome sequencing from herbarium specimen 
samples, including those dating back to the 
mid-19th century. Many evolutionary stud- 
ies have used DNA from museum specimens, 
including research on weeds, invasive spe- 
cies, and herbicide resistance (13, 14), but the 
scale and quality of the genetic data analyzed 
by Kreiner et al. have set the standard for 
future research. The authors followed best 
practices by extracting DNA in a laboratory 
designed to handle ancient DNA. Although 
herbarium-derived data about plant popu- 
lations must be interpreted with caution— 
given the necessarily limited sample size per 
time period and the known biases in plant 
collection (J3)—short-read whole-genome 
sequencing technology has made it pos- 
sible to analyze fragmented DNA, such as 
the degraded genetic material from older 
specimens. The judicious use of destructive 
sampling of museum specimens will play an 
ever-larger role in the study of evolutionary 
biology. Unfortunately, museums are under- 
funded and collection efforts are decreasing 
at a time when having whole specimens for 
future unknown uses is more important than 
ever given the rate of biodiversity loss (13). 
Kreiner et al. show that waterhemp 
adapted to agricultural intensification 
through migration and selection of preexist- 
ing genetic variation, followed by the rapid 
evolution of herbicide resistance through 
new mutations. As industrial-scale agricul- 
ture becomes more mechanized and technol- 
ogy driven (75), new selection pressures will 
spur the rise of new adaptations and weed 
species in crop fields. Future evolutionary bi- 
ologists will rely on continued collection of 
herbarium specimens, most likely analyzed 
with currently unavailable technology, to 
understand the ongoing arms race between 
weeds and agricultural pest management. 
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DRUG DEVELOPMENT 


Getting 
bifunctional 
molecules 
into cells 


A class of transmembrane 
proteins helps shuttle 
large drugs across the 

cell membrane 


By R. Scott Lokey! and Cameron Pye? 


imultaneously optimizing biochemical 

potency and membrane permeability 

remains one of the biggest challenges 

in drug development. For typical drug 

targets, such as enzymes and recep- 

tors, finding a configuration of atoms 
that will allow target binding with high af- 
finity can be relatively easy to achieve using 
drug-like molecules, which are generally be- 
low ~600 Da. However, there is a long list of 
therapeutically compelling targets inside the 
cell that either lack the kinds of deep pockets 
appropriate for binding drug-like small mol- 
ecules or have multiple pockets separated 
over large distances. For molecules that 
are large enough to effectively engage chal- 
lenging protein interfaces, getting past the 
cell membrane can be insurmountable. On 
page 1097 of this issue, Lou et al. (1) describe 
a mechanism by which large, dumbbell- 
shaped (bitopic) molecules with molecular 
weights as high as 1784 Da can access the 
cytoplasm, revealing a potential avenue to 
reach intractable intracellular targets. 

Most small molecules cross membranes 
by diffusion (2). This passive mechanism of 
membrane transport requires striking a bal- 
ance between membrane permeability and 
other competing factors. If a molecule is too 
hydrophilic, it cannot partition into the mem- 
brane, whereas if it is too hydrophobic, it will 
tend to aggregate in aqueous compartments 
or get stuck in the membrane. Although 
navigating these constraints with drug-like 
small molecules is commonplace, the rules 
that govern passive permeability become less 
clear when molecules become larger (3, 4). 
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Large macrocyclic peptides (MCPs) from 
natural products serve as counterexamples 
to the long-held assumption that the passive 
mechanism is only available to low-molecu- 
lar weight drugs. Cyclosporine A (CSA), an 
immunosuppressive drug with a molecular 
weight of >1200 Da, maintains passive cell 
permeability and oral bioavailability despite 
its size (5). Early examples of fully synthetic 
MCPs are making their way into the clinic (6), 
and more will follow as the tools and rules to 
enable the design of MCPs with passive per- 
meability are refined. However, for a growing 
number of large, bitopic molecules, like those 
described by Lou et al., their cellular poten- 
cies are difficult to reconcile with often very 
low passive permeabilities. Such 
cases suggest the possibility of 
more active modes of transport 
through cell membranes, raising 
the question as to which pro- 
teins, if any, are involved. 

Lou et al. take a functional 
genomics approach. They used 
a genome-wide CRISPR screen 
in patient-derived chronic my- 
eloid leukemia cells to identify 
proteins whose expression is re- 


with asciminib, an allosteric inhibitor that 
binds to the myristoyl pocket, which is 
distant from the active site. They also pro- 
duced a covalently cross-linked dimer of 
the natural product rocaglamide, which 
binds the eukaryotic initiation factor 4A-I 
(EIF4A1) helicase and inhibits translation. 
Not only were DasatiLink-1 and BisRoc-1 
active in cells, but, as with RapaLink-1, the 
potency of both drugs was significantly en- 
hanced by IFITM expression. Although the 
cellular potency of the rocaglamide dimer 
(BisRoc-1) was similar to that of the unlinked 
molecule—which was also the case for the in- 
dividual component drugs that comprise Ra- 
paLink-1 and DasatiLink-l1—reducing IFITM 


Gaining entry to cells 
Membrane permeability is a major challenge in drug development. Biologics, 
such as antibodies, can only target extracellular protein domains and soluble 
targets. Macrocyclic peptides (MCPs) from natural products can passively 

diffuse across the cell membrane. Small molecules can be designed to passively 
traverse the cell membrane, but their size limits the region they can bind in 

target proteins. Bitopic molecules can engage much larger regions of intracellular 
proteins, broadening potential targets, and their entry into cells is at least 
partially mediated by interferon-induced transmembrane proteins (IFITMs). 


potent bitopic molecules (8). The extent to 
which IFITM proteins are required for the 
membrane transport of such molecules, and 
the structural requirements for IFITM-me- 
diated transport, will be exciting areas for 
further study. 

Although it is inspiring to see an ad- 
ditional mechanism of cell entry start to 
be elucidated, relying on IFITM-mediated 
transport in the therapeutic context could 
introduce complexities and challenges. The 
expression of IFITM proteins is likely to vary 
between patients and even among cell types 
within the same patient. Furthermore, for 
anticancer compounds that require IFITMs 
for therapeutic efficacy, down-regulation of 
IFITMs could emerge as a mech- 
anism of acquired resistance. 

There is clearly a rich land- 
scape of bioactivities in large, 
multifunctional molecules. Nat- 
ural products have shown the 
promise of breaking the rules of 
traditional small-molecule me- 
dicinal chemistry, but the chal- 
lenges associated with achieving 
their biochemical complexities 
while balancing drug-like prop- 


quired for the cellular effect of Biologics Small molecules and MCPs Bitopic molecules erties have long placed limits 
RapaLink-1, a bitopic inhibitor >150 kDa <~1500 Da < ~1800 Da on efforts to mimic their success 
of mammalian target of rapamy- Q0 9° , with purely synthetic molecules. 
cin (mTOR) composed of the gore ee ni The excitement around targeted 
allosteric binder rapamycin co- 1 No intracellular Passive madiated protein degraders [proteolysis- 
valently attached to the kinase access diffusion entry targeting chimeras (PROTACs)] 


active site inhibitor sapanisertib 

by way of a long, flexible linker. 

Despite its high molecular 

weight of nearly 1800 Da, Ra- 

paLink-1 shows unexpected cel- 

lular efficacy and is even able 

to traverse the blood-brain barrier, which 
is more limiting than the gut wall or the 
membranes of isolated cells. Their screen 
revealed a strong chemical-genetic relation- 
ship between the cellular efficacy of Rapa- 
Link-1 and the expression of interferon-in- 
duced transmembrane proteins (IFITMs). 
Furthermore, the sensitivities of hundreds 
of cell lines to RapaLink-1 were highly cor- 
related to IFITM expression. Combined with 
the hypothesized role of IFITMs in blocking 
viral entry by modulating membrane prop- 
erties at the virus-host cell interface (7), 
these observations suggest a role for IFITM 
proteins in mediating bitopic drug cell entry 
(see the figure). 

The authors also investigated the effect of 
IFITM expression on the potency of other 
bitopic molecules unrelated to RapaLink-1. 
Lou et al. generated an inhibitor of the on- 
cogenic kinase breakpoint cluster region 
(BCR)-ABL1 called DasatiLink-1, in which 
they combined dasatinib, a well-established 
clinical inhibitor of the kinase active site, 
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Plasma membrane 


Target 
proteins 


expression diminished the efficacy of the 
dimer but not rocaglamide alone. 

The work by Lou e¢ al. provides a glimpse 
into a potential mechanism by which large, 
bitopic molecules can enter cells, but the 
true importance of IFITM proteins in medi- 
ating the cellular effects of these molecules 
awaits further study. For all three of the bio- 
active bitopic molecules described, the dif- 
ferences in potency when the three JFITM 
family members were deleted compared 
with wild-type cells were modest: 6-fold 
for RapaLink-1, 2.5-fold for DasatiLink-1, 
and 2.2-fold for BisRoc-1. This raises ques- 
tions about the residual activity of these 
molecules in IFITM-deleted cells: Are there 
other, protein-mediated mechanisms by 
which these molecules get into cells, or is 
the effect mediated by IFITMs indirect? Per- 
haps another CRISPR screen using /FITM- 
deleted cells could reveal additional play- 
ers. It is also possible that these molecules 
exhibit a limited degree of passive perme- 
ability, as has been shown with other highly 


and other induced proximity 
i} strategies over the past de- 
- cades has at least partially been 
driven by the modular, design- 
able nature of these molecules, 
whereby two distinct and un- 
related binders can be linked together to 
achieve an entirely new biological effect. 
Criticisms of these molecules often focus on 
their atypical molecular structures and the 
accompanying poor biophysical properties, 
but Lou et al. pave the way for further un- 
derstanding the complexities of cell entry 
and bioactivity of this rapidly evolving class 
of therapeutic molecules. 
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COMPUTER SCIENCE 


AlphaCode and “data-driven” programming 


Is ignoring everything that is known about code the best way to write programs? 


By J. Zico Kolter'? 


ompetitive programming problems 
represent a challenging task for even 
skilled programmers: Given a short 
natural language description of an al- 
gorithmic problem, contestants must 
quickly write a program that solves 
the task. On page 1092 of this issue, Li et 
al. (1) present the AlphaCode system, which 
represents a substantial step forward in the 
development of machine learning (ML) mod- 
els that can synthesize computer programs 
to solve these types of challenging problems. 
But what is perhaps most surprising about 
the system is what AlphaCode does not do: 
AlphaCode contains no explicit built-in 
knowledge about the structure of computer 
code. Instead, AlphaCode relies on a purely 
“data-driven” approach to writing code, 
learning the structure of computer programs 
by simply observing lots of existing code. 
AlphaCode was built by using a type of 
ML model called a “large language model” 
(LLM), which operates by predicting text 
one “token” at a time (tokens are collections 
of a small sequence of characters, but they 
can be thought of as single characters in the 
code). AlphaCode predicts the first character 
in the solution code (given the problem de- 
scription), then the second character (given 
the description and first character), the third 
character (given the description and first and 
second characters), and so on. Recent work 
has highlighted the potential for LLMs to 
generate code—for example, evaluations of 
the GPT-3 system (2) for code creation and 
the release of GitHub’s Copilot using the 
Codex system (3)—but AlphaCode’s ability 
to generate entire programs that solve com- 
petitive programming problems is a notable 
advance. Although AlphaCode contains some 
architectural differences from this past work 
(for example, AlphaCode does not need to 
generate problem descriptions, so it uses a 
slightly different mechanism to encode the 
problem text), the underlying system oper- 
ates in a manner similar to previous LLMs. 
What fundamentally makes AlphaCode 
outperform other systems on the competitive 
programming task boils down to two main 
attributes: training data and postprocessing 
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of candidate solutions. Arguably, the most 
important lesson for any ML system is that 
it should be trained on data that are similar 
to the data it will see at runtime. For this 
purpose, the AlphaCode developers built the 
CodeContests dataset, building on two ex- 
isting datasets (4, 5). CodeContests consists 
of the explanatory text of ~13,500 competi- 
tive programming problems, some simple 
test cases of desired input-output pairs, and 
many potential solutions for each problem 
across several programming languages. The 
system is also trained on unstructured code 
from GitHub (characters of code in many lan- 
guages, without a problem description). 
However, generating a single piece of solu- 
tion code for a problem performs poorly; writ- 
ing code token by token is prone to mistakes 
by writing invalid code or programs that pro- 
duce incorrect results. Thus, AlphaCode also 
generates thousands of candidate solutions 


“It may seem surprising that 
this procedure has any chance 
of creating correct code.” 


and filters them by ensuring that they are 
valid and pass simple test cases. The system 
also clusters similar solutions and submits a 
single example from each cluster to increase 
the diversity of the potential solutions. 

What is notably missing from the 
AlphaCode system is any architectural design 
in the ML model that relates to the task of 
interest: producing code. Computer code is 
a highly structured medium; programs must 
adhere to a defined syntax and must pro- 
duce well-defined pre- and postconditions 
within the different portions of the solution. 
Program synthesis has a long history (6) and 
numerous techniques have been developed 
for generating programs that obey these 
types of constraints. It seems only natural 
that given a medium as structured as com- 
puter code, this structure would be used by 
ML models aiming to write code. 

But AlphaCode does none of this. It gener- 
ates code the way LLMs generate any text— 
one token at a time—and only checks for pro- 
gram correctness after the entire program 
has been written. It may seem surprising that 
this procedure has any chance of creating 
correct code. But the reality is that given the 
proper data and model complexity, coherent 


structure can emerge. The ultimate recipe for 
this sequential generation procedure is bur- 
ied inscrutably within the parameters of the 
LLM. There are notable limitations to this 
general approach: Models can sometimes ef- 
fectively memorize small patterns within the 
dataset and then output such patterns in re- 
arranged fashion, leading some to term them 
“stochastic parrots” (7)—systems with no real 
understanding of the underlying question but 
that are able to blindly mimic likely outputs. 
Yet AlphaCode is not merely copying existing 
solutions: Li et al. examine and rule out this 
possibility. Ultimately, AlphaCode performs 
remarkably well on previously unseen coding 
challenges, regardless of the degree to which 
it truly “understands” the task. 

Yet it seems inherently disappointing if all 
the specialized knowledge about the struc- 
ture of computer programs was ultimately of 
limited practical value for LLMs trained on 
lots of data. Intuitively, understanding the 
structure of programs should be important 
to building ML models that can write them. 
And maybe this will eventually be the case; 
there is room left for improvement in the 
CodeContest benchmarks (AlphaCode solves 
~30% of the problems), and perhaps there 
will be future systems that leverage the power 
of AlphaCode combined with more classi- 
cal structured program synthesis to improve 
results. But the possibility should be taken 
seriously that this is not true, that data and 
scale are all that is needed, once referred to 
as “the bitter lesson” (8). Thus, in addition to 
being a compelling achievement, AlphaCode 
is perhaps best viewed as a baseline for the 
power of “raw” large-scale models to write 
code. If “hybrid” ML methods that combine 
data-driven learning with engineered knowl- 
edge can perform better on this task, let them 
try. AlphaCode cast the die. The datasets are 
public. Let us see what the future holds. 
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CLIMATE POLICY 


Fairness considerations in 
global mitigation investments 


Current mitigation finance flows are inadequate and unfair 


By Shonali Pachauri', Setu Pelz‘, Christoph 
Bertram’, Silvie Kreibiehl?, Narasimha D. 
Rao, Youba Sokona**, Keywan Riahi? 


espite overwhelming evidence that 
the world needs to make rapid 
and substantial investments in cli- 
mate mitigation in this decade to 
meet the ambitious goals of the 
Paris Agreement (J-3), political 
and financial barriers continue to hin- 
der mitigation efforts (2). Global mitiga- 
tion investment pathways modeled in the 
sixth assessment report (AR6) of the In- 
tergovernmental Panel on Climate Change 
(IPCC) reach global climate goals in a cost- 
effective manner. These are agnostic about 
who should finance these and how to fairly 
allocate costs and benefits of mitigation 
efforts. We apply equity considerations to 
global cost-effective mitigation investment 
needs and derive “fair-share” regional con- 
tributions, which describe the direction 
and magnitude of interregional financial 
flows that align with each consideration. 
We find that flows from North America 
and Europe to other regions would have to 
increase substantially relative to present 
levels to meet the Paris Agreement goals 
under most equity considerations. 
Progress on the alignment of financial 
flows with low greenhouse gas (GHG) 
emissions pathways remains slow (3). In 
2019 and 2020, annual global climate fi- 
nance flows were about USD (2015) 630 
billion (with more than 90% for mitiga- 
tion), but growth has slowed recently. The 
IPCC’s ARG stresses that these flows must 
increase globally by a factor between three 
and six to meet average annual needs until 
2030 to avoid the most dangerous impacts 
of climate change. Adequate capital and 
liquidity for this is globally available, as 
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is evident from the USD 2.4 trillion world 
energy investment in 2022 estimated by 
the International Energy Agency. The IPCC 
report also states that “accelerated finan- 
cial international cooperation is a critical 
enabler of a low-greenhouse gas and just 
transition.” In particular, adequate inter- 
national support for near-term investment 
is essential to ensure that national policies 
are put in place to attract the required fi- 
nance in this decade. 

Global mitigation investment needs in 
the IPCC’s AR6 are based on pathways 
generated by integrated assessment mod- 
els AMs). Several recent critiques of IAMs 
engage with the history, assumptions, limi- 
tations, and normative positioning of such 
exercises (4). Although several studies 
propose fair global carbon budget-shar- 
ing schemes, few focus on equity consid- 
erations in the financing of mitigation 


investments, and these largely disregard 
near-term investment flows (5). We build 
on literature that argues that cost-effective 
mitigation investments require recogni- 
tion of differentiated responsibilities, ca- 
pabilities, and needs to yield an equitable 
outcome and be realized (6). 

Consistent with suggestions in previous 
literature, we find that distributive justice 
considerations in global climate mitiga- 
tion will require substantial interregional 
finance flows (7). Although mitigation ac- 
tivities involve costs that are distinct from 
investments, our work focuses specifically 
on modeled estimates of regional mitiga- 
tion investment needs. This work provides 
a pathway to address the retrospective and 
prospective perspectives on climate equity 
in the literature: first, that wealthier high- 
emitting countries have historically ben- 
efited from fossil energy at the cost of poorer 
low-emitting countries (8), and second, that 
for cost-effective mitigation pathways to be 
fair, national and international redistribu- 
tive measures are likely necessary (9). The 
magnitude and direction of interregional 
flows that we derive can also serve as input 
for policy and climate negotiations in the 
short term to ratchet up mitigation ambi- 
tion, signal to the international private fi- 
nancial sector the magnitude of the required 
increase of interregional finance, and guide 
industrial pathways and value chain de- 
velopment toward a just and sustainable 
energy transition. 


Regional needs and fair-share contributions 

Annual regional cost-effective investment needs from 2020 to 2030 aligned with the well-below 2°C 

and 1.5°C compatible targets are shown (black, spanning the range from IPCC AR6). Recent regional 

average annual investments are shown for 2017 to 2020 (gray bars). Ranges of fair-share regional contributions 
to investment needs are shown using distinct considerations of equity (colors). 


1600 — IPCC AR6 cost-effective needs 
@ Recent regional average investments 2017-2020 

1400 Responsibility 

ry — RI: CO, emissions 1850-2019 

s 1200 — R2: CO, emissions 1990-2019 

3 Capability 

= 1000 — Cl: GDP per capita 2019 

ry — C2: Capital stock per capita 2019 

o 800 

x Needs 

a : Bas 

B 600 — NI: DLS deprivation 2016 

a — N2: Climate risk 2030 

a 

2 400 


SAP MEA APD AFR 


| 


| 


7 i i, l | Wi fy It I i ton 


SAS 


LAC EEA EAS EUR NAM 


Regions are defined as follows: SAP, Southeast Asia and developing Pacific; MEA, Middle East; APD, Asia-Pacific developed; AFR, Africa; SAS, 
southern Asia; LAC, Latin America and Caribbean; EEA, Eastern Europe and west-central Asia; EAS, Eastern Asia; EUR, Europe; NAM, North 
America. DLS, decent living standards; GDP, gross domestic product; IPCC AR6, Intergovernmental Panel on Climate Change sixth assessment 


report; PPP, purchasing power parity 


9 DECEMBER 2022 + VOL 378 ISSUE 6624 1057 


INSIGHTS | POLICY FORUM 


Gaps between needs and fair-share contributions 
Regional lower-bound cost-effective mitigation investment needs (dashed lines) and fair-share contributions under distinct considerations of equity (Rs, Cs, and Ns, 
where R is responsibility, C is capability, and N is needs) are shown as shares of total lower-bound global mitigation investment needs (see fig. S2 for upper-bound shares 
and fig. S3 for flows as share of regional gross domestic product). Some within-region needs can be met by fair-share contributions from countries in the region (@). 
Some regions’ fair-share contributions exceed their needs and can help meet needs in other regions (“). Some regions’ needs exceed their fair-share contributions, 
leaving gaps to be met by contributions from other regions (@). 
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Regions are defined as follows: SAP, Southeast Asia and developing Pacific; MEA, Middle East; APD, Asia-Pacific developed; AFR, Africa; SAS, southern Asia; LAC, Latin America and Caribbean; EEA, Eastern Europe and 
west-central Asia; EAS, Eastern Asia; EUR, Europe; NAM, North America. R1, CO, emissions 1850-2019; R2, CO, emissions 1990-2019; C1, gross domestic product per capita 2019; C2, capital stock per capita 2019; N1, 


decent living standards deprivation 2016; N2, climate risk 2030 


EQUITY CONSIDERATIONS AND INDICATORS 
We begin with the range of cost-effective 
regional mitigation investment needs in 
the decade 2020-2030 to achieve targets 
compatible with well below 2°C and 1.5°C 
warming, as provided in the IPCC AR6 (3). 
Regions are made up of countries and ter- 
ritories collected into broad geographical 
groups following the IPCC country grouping 
(see table S1). Regional mitigation invest- 
ment needs [defined in supplementary ma- 
terials (SM) section 2 and table S2], include 
low-carbon energy resource extraction, con- 
version, power generation, transmission, 
and storage, as well as economy-wide en- 
ergy efficiency improvements (3). 

Our subsequent consideration of eq- 
uity is consistent with principles emerg- 
ing in the climate equity literature that 
correspond to considerations of respon- 
sibility, capability, and needs, which in 
turn underlie the notion of “Common 
but Differentiated Responsibility and 
Respective Capabilities (CBDR-RC) in light 
of national circumstances” enshrined in 
the Paris Agreement (6). We apply these 
equity considerations to allocate “fair- 
share” regional contributions to annual, 
cost-effective, global climate mitigation in- 
vestment needs in the decade 2020-2030 
in proportion to the indicators we select 
for each (see tables S4 and S5). 

We draw on the literature in selecting 
established indicators for regional re- 
sponsibility and capability and introduce 
two new indicators that describe regional 
needs, which have not previously been 
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operationalized in the IAM literature. We 
measure responsibility (R) as historical 
cumulative emissions shares. Given diver- 
gent views on when countries should be 
held accountable for their emissions, we 
implement two periods, one from 1850, ac- 
counting for postindustrial contributions, 
and the other since 1990, the year of the 
first IPCC assessment report. We consider 
only emissions of the dominant long-lived 
climate forcer, carbon dioxide (CO,), from 
fossil fuel and industry, because other GHG 
emissions have not yet been thoroughly ex- 
plored in the climate equity literature. 

For capability (C), we use per capita 
gross domestic product (GDP) (C-1) and 
per capita capital stock (C-2), an alter- 
native wealth indicator that reflects the 
extent of physical fixed assets and infra- 
structure in an economy. Under the con- 
sideration of needs (N), we use the aver- 
age degree of deprivation across distinct 
dimensions of human well-being encom- 
passed by the decent living standards (10) 
(N-1) and the modeled share of regional 
population facing multisector climate risk 
in the year 2030 (17) (N-2). We propose 
these needs indicators because they are 
not composites of the others used (such as 
per capita GDP) and capture both retro- 
spective and prospective aspects of climate 
equity in terms of achieved human well-be- 
ing and future vulnerability to climate ef- 
fects. (See SM for a complete description of 
the methods used to transform each indi- 
cator into a corresponding regional fair- 
share contribution.) 


GLOBAL SCALE FINANCE FLOWS 

The IPCC AR6 reported large investment 
gaps between the recent average invest- 
ment levels (2017-2020) and cost-effective 
investment needs over the decade 2020- 
2030. For most regions, it reported that re- 
cent investments were about three to four 
times lower in magnitude than the cost- 
effective needs (see the figure). However, 
in some regions, the gaps are much wider. 
We find that these gaps shift dramatically 
when principles of equity are considered 
to derive fair-share contributions, requir- 
ing large interregional financial flows (see 
the figure). Our estimated range of inter- 
regional flows to meet fair-share regional 
contributions is between purchasing 
power parity (PPP) USD (2015) 248 billion 
and PPP USD (2015) 1581 billion annually 
during 2020-2030 (see fig. S1). 

The magnitude of interregional finan- 
cial flows required is lowest (i.e., closest 
to cost-effective needs) when considering 
responsibility for historical cumulative 
CO, emissions since 1990 (R-2) and high- 
est when considering the capabilities of 
regions (C-1 and C-2). With the exception 
of R-2, fair-share contributions under any 
equity consideration would be far higher 
than cost-effective needs in North America 
and Europe and lower in Africa, South and 
Southeast Asia, and Latin America. That is, 
regional cost-effective investment needs in 
low- and middle-income countries (LMICs) 
are higher than their fair-share contribu- 
tions under most equity considerations 
(see the figure). Accounting for cumula- 
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tive CO, emissions since 1990 favors North 
America and Europe over other regions 
that experienced much of their industrial 
growth in recent decades. By contrast, we 
see that capability- and needs-based alloca- 
tions (C- and N-) require substantial [PPP 
USD (2015) 657 billion to PPP USD (2015) 
1.581 trillion] mitigation finance flows to 
regions dominated by LMICs to bridge the 
gap between regional cost-effective invest- 
ment needs and fair-share con- 
tributions. As a practical mat- 
ter, and given differing notions 
of fairness, policy-makers may 
want to combine and weight 
equity considerations to find 
consensus on representing fair 
efforts in international negotiations. The 
adoption of more ambitious pledges in 
LMICs may depend on such consensus to 
guide stronger commitments to interna- 
tional financial flows. To aid this, we have 
developed a tool that allows for selection 
and weighting of equity considerations 
and corresponding indicators (https:// 


data.ece.iiasa.ac.at/fairmitigation/). 


IMPLICATIONS OF FAIRNESS 

Globally, climate mitigation investment 
gaps must be bridged to meet the agreed 
temperature goals of the Paris Agreement. 
How to finance global cost-effective cli- 
mate mitigation investment needs across 
regions is still debated. We show that near- 
term interregional financial flows con- 
sistent with respecting equity principles 
enshrined in the Paris Agreement are sub- 
stantial. Our proposal helps derive both 
the magnitude and directions of interre- 
gional flows that are necessary to incor- 
porate selected considerations of equity, 
providing a way to integrate principles 
of equity into established approaches for 
developing global cost-effective mitigation 
scenarios. Importantly, our results indicate 
that interregional flows must be scaled up 
no matter which combination of our se- 
lected equity considerations and indicators 
we consider. 

This work is consistent with other re- 
cent efforts to consider equity more ex- 
plicitly in modeled mitigation pathways 
through alternative methods, such as ap- 
plying specific social welfare functions, 
projecting degrowth in the Global North 
or more convergent growth futures, and 
explicitly accounting for regional popula- 
tions in poverty to safeguard and exclude 
them from mitigation efforts in the near 
term. In reflecting on recent critiques of 
the AR6 IPCC mitigation pathways, we 
show here that cost-effective mitigation 
pathways are consistent with CBDR-RC as 
enshrined in the Paris Agreement when eq- 
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uity considerations guide the allocation of 
necessary financial flows. Future modeling 
efforts that explicitly represent the finance 
sector and specific economic instruments 
can also help derive fair interregional 
flows endogenously. 

Our proposal and accompanying tool 
to derive fair-share near-term regional 
contributions to global mitigation invest- 
ment needs does not yet address the much- 


“,.-We find that distributive justice considerations 
in global climate mitigation will require substantial 


interregional finance flows.” 


needed investments to meet both regional 
climate adaptation needs and loss and 
damages, which are a priority for most 
LMICs (12). Moreover, we recognize that 
the current target of USD 100 billion per 
year promised by developed nations for 
climate action in developing nations under 
the Paris Agreement has been problematic. 
Among the key complexities of this tar- 
get is that it serves, at times, two distinct 
purposes: both to redistribute resources 
from developed to developing nations and 
to mobilize the scale of finance needed to 
achieve the Paris Agreement targets (73). It 
also does not clearly differentiate between 
adaptation and mitigation needs. 

Although our work exploring the magni- 
tude and direction of interregional finan- 
cial flows that must be mobilized under 
different considerations of equity can in- 
form negotiations, agreement around new 
targets for mitigation and adaptation cost 
support as well as international redistribu- 
tion will clearly need to deal with these is- 
sues. What is clear from our results is that 
even when considering responsibility for 
historical cumulative CO, emissions since 
1990 (R-2), which is an indicator most fa- 
vorable to regions of the Global North, the 
magnitude of annual interregional flows 
just for mitigation action alone must in- 
crease substantially to PPP USD (2015) 250 
billion to PPP USD (2015) 570 billion in the 
near term. 

Several caveats apply to our proposal. 
There are other justice considerations and 
intraregional distributional concerns that 
we do not consider, such as claims for com- 
mitted climate impacts or how local ben- 
efits and returns can be fairly distributed. 
Interregional financial flows, as implied by 
our paper, can be mobilized through a num- 
ber of different instruments, and each may 
have its own implications in terms of politi- 
cal feasibility and economic effectiveness. 
This is particularly important because re- 
gional investment risk profiles differ, which 


may also hinder financial flows. Finally, 
we do not address the wider issue of the 
distribution of macroeconomic costs and 
benefits resulting from the investments, 
which may change over time and require 
adaptive frameworks to motivate ambitious 
global mitigation strategies. A recent pro- 
posal toward fair-efforts metrics suggests 
one approach to account for true costs of 
mitigation technologies and nonmonetary 
benefits that may help advance 
this understanding (14). 

Clear institutional and regu- 
latory frameworks are needed 
to mobilize the magnitude 
of finance that is required to 
achieve globally agreed upon 
climate targets. Agreement on how to re- 
direct international and domestic finance 
toward urgent near-term mitigation in- 
vestments and climate adaptation efforts 
will be critical. Continued neglect of dif- 
ferentiated responsibilities, capabilities, 
and needs in the regional allocation of 
mitigation investment contributions risks 
lose-lose outcomes. Interregional coopera- 
tion will be necessary to move past this 
gridlock. Our work describes one pathway 
toward finding consensus by embedding 
distinct considerations of distributional 
justice in the derivation of “fair” regional 
contributions to globally cost-effective mit- 
igation investment needs. This can inform 
estimates of the support required to bridge 
interregional financing gaps. Progress here 
will serve as a clear signal to governments, 
industry, and nongovernment actors and 
will be crucial for building the neces- 
sary momentum in regions where finance 
is scarce. 
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Data-based tools can 
prevent crowd crushes 


The formation of mass crowds, in the 
absence of external danger, should not 
pose a threat to human lives. However, 
deadly crowd crush incidents, such as the 
tragedy in Seoul on 29 October (1), continue 
to occur. The Seoul incident was reminis- 
cent of several similar recent catastrophes 
(2), all of which involved an unpredictable 
crowd size, a restrictive space with no 
evacuation routes, unrestricted entry, and 
the absence of real-time density monitor- 
ing. The science of crowds can help to pre- 
vent future crowd crush disasters. 

The principles of crowd turbulence are 
well understood (3), and the connection 
to crowd density is clear. When a crowd 
of several thousands of humans reaches 
critical density levels of 8 to 10 persons/m?, 
the crowd acts like a continuous body (or 
fluid). Individuals lose control of their 
movement, and any small turbulence could 
magnify and propagate through the crowd, 
potentially creating instability, falls, and 
injuries. In such a scenario, there is little 
that individuals can do once trapped. The 
description of such incidents as a crowd 
“stampede”—implying misbehavior on 
the part of the people—obscures the root 
causes and hinders prevention. 

Solutions are available. Crowd manage- 
ment needs to be recognized as a specialty 
and should be outsourced to professionals 
within the industry who have the exper- 
tise and tools to conduct estimation of the 
expected crowd size, risk analysis, and— 
most important—real-time monitoring and 
control. In doing so, the role of computer- 
vision and artificial intelligence methods in 
real-time estimation of flow and density is 
critical (4). Although an element of unpre- 
dictability may always exist in terms of the 
exact crowd size (especially in the case of 
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events with no ticket sales), these tools can 
provide the ability to observe early signs of 
overcrowding and take appropriate actions. 

It is important that interventional mea- 
sures are implemented before the critical 
density levels and crowd instability are 
reached. Two metrics are key in making 
such determinations: Crowd density in a 
restricted area gives insight into the cur- 
rent state of the crowd, and the crowd 
flow provides information about the 
input rate, and thereby, the rate at which 
density is increasing. These two metrics 
together can provide an estimate of the 
time that it may take for the crowd to 
reach critical density levels. 

A range of methodologies developed 
by computer vision scientists apply artifi- 
cial intelligence to infer these important 
parameters from real-time surveillance 
video footage. Real-time estimation of 
crowd sentiment, using indicators such as 
social media activity (5, 6), could further 
inform real-time interventional decisions. 
Crowd crushes are preventable, and the 
required tools and knowledge do exist. 
They only need to be implemented. 
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Co-management of Chile’s 
escaped farmed salmon 


Salmon escapes from aquaculture farms 
taint Chile’s reputation as a sustainable global 
provider of salmonid products (7). Massive 
salmon escapes from sea-based farms in 
Patagonia are frequent; escapes occur 1 to 16 
times per year, and about 5 million salmon 
escaped between 2010 and 2020 (2). Escapes 
result in socio-environmental risks (2, 3) and 
may increase the likelihood that salmon will 
become invasive in Patagonia (2, 4). Adverse 
effects of aquaculture salmon on native biodi- 
versity have been well documented, including 
the declines of native and threatened fresh- 
water fish populations (2, 5, 6). Traditional 
livelihood fishing activities of local commu- 
nities have also been disrupted by escaped 
salmon (6). There is a clear need to reduce 
these impacts on Patagonia, a valuable global 
natural reserve (2). 

In the past, escapes were considered the 
private property of aquaculture companies. 
Legislation compelled them to recapture 
at least 10% of escaped fish and prohibited 
capture by other parties, including bycatch 
(7). However, the companies only recaptured 
approximately one-third of escapees (8), 
whereas artisanal fishers have been respon- 
sive, mobile, and efficient in capturing and 
selling escapees in informal markets (6, 8), a 
practice that remains illegal (7). 

On 25 October, Chile’s congressional com- 
mittee passed a bill, which will be rectified 
by Congress, deprivatizing aquaculture 
salmon and trout escapes in bycatch and 
implementing tougher sanctions for the 
companies responsible for the escapes (7). 
The approved bill includes a reeommenda- 
tion by scientists and conservationist enti- 
ties to allow artisanal and recreational fish- 
ers to increase fishing pressure on escaped 
salmon and trout and indicates that guide- 
lines for legally selling captured salmon 
will be defined in future bylaws (7). 

Co-management of Chile’s salmon escapes 
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Analyzing and monitoring crowds can help 
to prevent deadly crowd crush incidents. 


by aquaculture companies, recreational, and 
artisanal fishers could increase the recap- 
ture rate, reducing the escapees’ negative 
socio-environmental impacts on Patagonia. 
Norway, the largest salmon producer in the 
world (9), passed legislation in 2015 requir- 
ing license holders for salmon aquaculture 
to manage and fund efforts to recapture 

as many escapees as possible (70). Escaped 
salmon that elude recapture are subse- 
quently managed as wild salmon for har- 
vest (11). These changes have resulted in a 
decrease in both the number of escaped fish 
and the proportion of recaptured salmon 
caught on Norwegian coasts and fjords using 
bag nets and bend nets (11, 12). 

Co-management could also benefit local 
communities, if escaped fish are safe to 
eat. Some escaped fish may have been 
treated with antibiotics or other chemi- 
cals. Therefore, testing for such contami- 
nants should be mandatory before human 
consumption. 

Although the approved bill is a step in 
the right direction for the management of 
salmon and trout escapes, legislators and 
stakeholders should work toward a holistic 
approach that includes a long-term moni- 
toring program (2) and a complete charac- 
terization of the socio-ecological systems of 
Patagonia affected by these escape events. 
The recent period of profound political 
changes in Chile could allow for a paradigm 
shift in conservation strategies. 
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Long-term solutions for 
China’s heat and drought 


Since July, China has experienced the most 
severe heatwave ever recorded in the world, 
compounded by the worst drought since 
record-keeping began in the country six 
decades ago (1). The areas hardest hit by 
the combined heatwave and drought fall 
along the Yangtze River basin, a populous 
and immense industrial economic belt. 
Impacts range from disruptions in global 
supply chains (2) to reductions in habitat 
for critically endangered species such as the 
Yangtze finless porpoise. Long-term antici- 
patory policies are needed to mitigate these 
far-reaching effects, in China and beyond, 
yet the current policy approach favors 
short-term solutions that are ill-equipped 
to tackle persistent climatic extremes in an 
era of climate catastrophe. 

Poyang Lake, China’s largest freshwater 
lake, located along the Middle Reaches of 
the Yangtze River, is illustrative of the dev- 
astation triggered by this summer’s extreme 
weather. Suffering from more than 100 
days of drought and counting, the lake has 
shrunk by nearly 90% (3). Numerous rare 
fishes, porpoises, and more than 700,000 
migratory birds have been affected (3). 

The number of Yangtze finless porpoises, 
for example, was about 700 in 2021 (4), 

but few were found this October, mainly 
due to grounding and food shortages trig- 
gered by the drought (5). Poyang Lake has 
been listed as a wetland of international 
importance since 1992 (6) and provides key 
habitat for overwintering migratory birds, 
including white cranes and red-crowned 
cranes, yet the ongoing drought jeopardizes 
this critical habitat (3). 

China’s drought is extreme, but not 
exceptional. Since July, extreme heatwaves 


and persistent drought have swept 
across Europe, North America, and Asia 
(7). Climate change has led to a 20-fold 
increase in the likelihood of Northern 
Hemisphere droughts in 2022, with the fre- 
quency, spatial extent, and severity antici- 
pated to continue to increase throughout 
the century (8). As drought and extreme 
weather events become increasingly 
endemic, policy responses must shift from 
short-term reactive strategies to long-term 
proactive measures. 

Establishing a drought early monitor- 
ing and warning system that operates 
at timescales longer than 2 years and at 
regional rather than local levels is para- 
mount. Better drought information and 
predictions would permit water regulation 
among different sectors and industries 
across larger geographies and timeframes. 
Drought impacts can be mitigated by har- 
nessing the full potential of engineering 
projects, such as transferring water to lakes 
and reservoirs above Poyang Lake to avoid 
ecological catastrophe (9). Nature-based 
solutions, such as wetland restoration, 
will help mitigate long-term impacts to 
critically endangered species. Protecting, 
managing, and using low-lying, shallow 
lake areas can provide critical habitat for 
species during low-flow periods. Through a 
range of engineered and nature-based solu- 
tions, drought-stricken regions can develop 
long-term approaches to mitigating extreme 
weather events in the age of climate change. 
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Organic geochemistry in Jezero crater 
he Perseverance rover has investigated the floor 
of Jezero crater on Mars, finding that it consists 
of igneous rocks that were modified by reactions 
with liquid water (aqueous alteration). Scheller et al. 
used the rover to perform Raman and fluorescence 
spectroscopy of rocks at two locations within the crater. 
They identified the presence of organic molecules, including 
aromatics with one and two benzene rings. The presence 
of perchlorates allowed the authors toset a limit of more 
than 2 billion years for the last time water filled the crater. 
Carbonates and sulfates were also found. The results 
demonstrate that the rocks in Jezero crater contain 
a record of ancient organic geochemistry. —KTS 


Science, abo5204, this issue p. 1105 


27 he Perseverance rover, pictured, has detected organic 
; - * cg 
and inorganic molecules within surface rocks on Mars. 


PR IA 
A potent(iator) 
treatment for ARDS 


Pneumonia can result in the 
development of acute respira- 
tory distress syndrome (ARDS), 
which is characterized by 
increased permeability of the 
lung's endothelial barriers to 
fluid and protein. Targeted treat- 
ments for ARDS are limited. 
Erfinanda et al. noted down- 
regulation of the cystic fibrosis 
transmembrane conductance 
regulator (CFTR) in pneumonia, 
resulting in increased endothe- 
lial permeability. The authors 
identified the mechanisms 
involved and showed that the 
CFTR potentiator ivacaftor 
stabilized CFTR abundance and 
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rescued CFTR function in vitro. 
In wild-type mice, treatment 
with ivacaftor before or 6 hours 
after initiation of Streptococcus 
pneumoniae pneumonia 
resulted in improved barrier 
function and CFTR abundance 


and reduced signs of lung injury, 


suggesting a role for ivacaftor 
in the treatment of ARDS in 
humans. —MN 


Sci. Transl. Med. 14, eabg8577 (2022). 


Protection from PAH 
Pulmonary arterial hyperten- 
sion (PAH) is partially driven 
by pulmonary arterial vascular 
smooth muscle cell (PAVSMC) 
proliferation. Shen et al. inves- 
tigated how PAH pathogenesis 
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was regulated by TSC2, an 
inhibitor of cell growth—promot- 
ing signaling. TSC2 levels were 
reduced in pulmonary arter- 
ies and PAVSMCs from PAH 
patients, and TSC2 deficiency 
in smooth muscle in mice 
resulted in PAH. Restoring TSC2 
abundance in rodent disease 
models improved lung function, 
suggesting that increasing TSC2 
levels in PAVSMCs could reverse 
PAH, which currently lacks 
effective treatments. -WW 

Sci. Signal. 15, eabn2743 (2022). 


—— 


A path for a high plateau 
The timing of the uplift of the 
Qinghai-Tibet plateau influ- 
ences how we interpret the 


Y 


atmospheric circulation and 
environment tens of millions of 
years ago. Miao et al. analyzed 
pollen records from a new sedi- 
mentary record to constrain the 
elevation of the northern part of 
the plateau. The authors found 
that the plateau likely obtained 
its current or higher elevation 
in the late Miocene, roughly 10 
million years ago. —BG 

Science, abo2475, this issue p. 1074 
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Al masters Diplomacy 

The game Diplomacy has been 

a major challenge for artificial 
intelligence (Al). Unlike other 
competitive games that Al has 
recently mastered, such as 
chess, Go, and poker, Diplomacy 


science.org SCIENCE 


PHOTO: NASA/JPL-CALTECH/MSSS 


CHRISTIAN CHARISIUS/PICTURE ALLIANCE/GETTY IMAGES 


PHOTOS (LEFT TO RIGHT) IDRUS-SAIDI ET AL.; 


cannot be solved purely 
through self-play; it requires 
the development of an agent to 
understand other players’ moti- 
vations and perspectives and to 
use natural language to negoti- 
ate complex shared plans. The 
Meta Fundamental Al Research 
Diplomacy Team (FAIR) et al. 
developed an agent that is able 
to play the full natural language 
form of the game and demon- 
strates performance well above 
the human average in an online 
Diplomacy league. The present 
work has far-reaching implica- 
tions for the development of 
cooperative Al and language 
models for communication with 
people, even when interactions 
involve a mixture of aligned and 
competing interests. —YS 
Science, ade9097, this issue p. 1067 


Many metal morphologies 
Metals will easily dissolve in 
liquid gallium at elevated tem- 
peratures and then precipitate 
out of solution when cooled 
down. However, the high sur- 
face tension of the liquid metal 
makes it very difficult to extract 
these crystals. Idrus-Saidi et 

al. developed an extraction 
method achieved by applying a 
voltage to the liquid metal solu- 
tion while vacuum filtrating. The 
resulting crystals can have intri- 
cate morphologies, with some 
looking similar to snowflakes. 
This strategy should produce 

a variety of morphologies for 


Scanning electron microscopy 
images of microscopic zinc crystals 
formed from a zinc-gallium alloy. 
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metal particles across a wide 
range of different elements and 
alloys. —BG 

Science, abm273]1, this issue p. 1118 


Biasing vinyl cation 
insertions 


Forming a vinyl cation leaves 
a doubly bonded carbon 
center stranded with just six 
electrons instead of eight. 
Because of this deficiency, 
these intermediates tend to 
react extremely quickly and 
indiscriminately with any other 
bond in the vicinity. Nistanaki 
et al. report that a negatively 
charged, phosphorus-based 
catalyst can engage a vinyl 
cation just tightly enough to 
steer an ensuing intramolecu- 
lar carbon—-hydrogen insertion 
reaction toward just one of 
two mirror-image products. 
Isotopic labeling and compu- 
tational analysis elucidate the 
likely mode of induction. —JSY 
Science, ade5320, this issue p.1085 


Resilient in the long term 
Tuberculosis caused by 
Mycobacterium tuberculosis is 
a persistent, sometimes life- 
long infection that needs long 
courses of multiple antibiot- 
ics to treat. Not surprisingly, 
antibiotic resistance is rife. Ina 
large sample of whole-genome 
data from clinical isolates, Liu 
et al. repeatedly observed an 
elongated phenotype showing 
rapid regrowth after typical 
regimens of antibiotic treat- 
ment. Signals of positive 
selection pointed to an essen- 
tial transcriptional regulator 
(resR) and specific intergenic 
regions, which act in concert to 
regulate growth. Up to 10% of 
strains from high-tuberculosis- 
burden countries showed fixed 
mutations in these regions. 
Mutations across this cascade 
are associated with antibiotic 
treatment failure and are 
precursors to the emergence of 
classical antibiotic resistance. 
—CA 

Science, abq2787, this issue p. 1111 


CLEAN ENERGY 
Faster is better 


Edited by Caroline Ash 
and Jesse Smith 


educing greenhouse gas emissions by switching from 

fossil fuels to noncarbon energy sources is necessary 

to mitigate climate change, even though the transi- 

tion will entail its own set of carbon dioxide emissions 

from energy and materials use. Lesk et al. show that 
the more quickly and aggressively we act to realize that 
transition, the better. They found that the energy needed to 
deploy renewable capacity sufficient for the more ambi- 
tious goal of limiting warming to 1.5°C is only 20% of that 
of a more gradual transition for a 2.0°C goal and only about 
10% of that which will be needed by 2100 if current policies 
remain unchanged and warming is 2.7°C, as projected. —HJS 
Proc. Natl. Acad. Sci. U.S.A. 119, e2123486119 (2022). 


Rapid deployment is key to minimizing the energy costs of the transition 


to a renewable energy future. 


All-seeing tiger 
shark’s eye 


Seagrass ecosystems are globally 
important because of their photo- 
synthetic capacity to draw down 
and store carbon in the underly- 
ing sediment. Unfortunately, 
seagrass meadows are highly 
vulnerable to damage, and we 
only have a hazy idea of their 

true extent. Knowing that tiger 
sharks are faithful to seagrass 
ecosystems, Gallagher et al. 
attached cameras and/or pop-up 
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satellite tags to 15 individual tiger 
sharks and exploited the animals’ 
migratory behavior to monitor the 
extent of seagrass ecosystems 
around the Bahamas. Shark 
surveys were complemented by 
Landsat 8 images and human 
diver surveys, which also col- 
lected sediment cores for stored 
carbon estimation. Tiger sharks 
reached areas off-limits to human 
SCUBA divers, and seagrass 
could be observed growing at 
depth in these clear waters. The 
area covered by seagrass mead- 
ows was estimated to be 57,337 to 
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92,524 square kilometers, which 
roughly equates to 19.2 to 26.3% 
of the carbon buried in seagrass 
sediments worldwide. —CA 

Nat. Commun. 13, 6328 (2022). 


IMMUNOLOGY 
Old, particular, 
and exhausted 


The immune system tends to 
become dysfunctional with age, 
resulting in increased chances 

of infection, autoimmunity, and 
cancer. This process, known 

as immunosenescence, is also 
associated with remodeling 

of lymphoid tissues. Ural et al. 
examined a cohort of human 
organ donors and found that 
lymph nodes associated with the 
lungs accumulated particulates 
derived from environmental 
pollutants as the individual aged. 
These particulates are prefer- 
entially taken up by a subset of 
CD68*CD169° macrophages, 
which subsequently become 
functionally impaired and develop 
altered cytokine production. 
Moreover, these lymph nodes, 
which are strikingly black in color, 
display progressive anatomical 
disruption. The selective accu- 
mulation of air pollutants may 

be responsible, at least in part, 
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for some of the immune defects 
experienced in late life. —STS 
Nat. Med. 10.1038/ 
s41591-022-02073-x (2022). 


GENETIC RISK 
Family history plus 
risk scores 


With the increasing use of 
genome-wide association studies, 
interest has turned to polygenic 
risk scores (PRSs), which esti- 
mate the risk of genetic disease 
based on counts of disease- 
associated alleles in a genome. 
Given the costs of generating 
genetic data, there has been 
debate about whether PRSs are 
any better than family history 

at predicting disease risk—after 
all, family history captures both 
genetic and nongenetic factors. 
Mars et al. used data from more 
than 300,000 individuals in the 
FinnGen research project to com- 
pare the relative contributions of 
family history and PRS against 
disease risk. Across 24 diseases, 
the authors found that although 
these measures capture differ- 
ent information, their joint use 
improves disease prediction. The 
relative differences in predictive 
power of the two metrics differ 
across diseases, with implications 
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for understanding how genetic 
architecture contributes to dis- 
ease. —CNS 
AJHG 10.1016/ 
j.ajhg.2022.10.009 (2022). 


QUANTUM OPTICS 
Quantum frequency 
conversion 


An optical-based quantum 
network connects individual 
nodes, transmitting informa- 

tion encoded in single photons 
between senders and receivers. 
However, because there are many 
different quantum platforms 
being developed that do not 
necessarily operate at same 
wavelength (even the nominally 
same platform can drift in wave- 
length), there is a need to convert 
the wavelength of the photons 
for compatibility. Bonsma-Fisher 
et al. demonstrated four-wave 
mixing in a photonic crystal 
fiber—based system that has the 
properties for efficiently convert- 
ing photons over a broad range of 
wavelengths. Demonstrating the 
effect with communication-band 
photons at 1551 nanometers, 

the authors were able to convert 
single photons to any wavelength 
between 1226 and 1408 nanome- 
ters. Such wavelength conversion 


The insect apocalypse 


erious declines in insect numbers across species 

and systems have been reported. However, some 

studies show increases or equivocal results, 

leading to debate about what is really happen- 

ing. Harvey et al. reviewed the effects of climate 
change, identifying the large number of threats that 
insects are currently facing and confirming that general 
patterns of decline are occurring. Overall, these threats 
fall into two categories: long-term responses to change 
and the acute impacts of extreme events. These threats 
are exacerbated by other anthropogenic factors, includ- 
ing habitat loss and agrochemical pollution. The authors 
conclude that overall damage to insect populations is 
high, and if the declines are not reversed, the impact on 
human activities will be significant if not catastrophic. 
—SNV Ecol. Monogr. 10.1002/ecm.1553 (2022). 


Global insect decline is caused by a combination of factors, 
including climate change, pollution (such as pesticides, which 
killed the bees pictured), and habitat loss. 


capability will be useful for linking 

up different quantum compo- 

nents across a network. —ISO 
Phys. Rev. Lett. 129, 2303603 (2022). 


PHYSICS 
Avan der Waals 
quantum sensor 


Solid-state defects, such as nitro- 
gen vacancy centers in diamond, 
have been used as local sensors 
of magnetic field, pressure, and 
temperature. These sensors are 
typically embedded in three- 
dimensional structures, making it 
tricky to optimize the sensor-sam- 
ple interface. Healey et al. instead 
used defects in hexagonal boron 
nitride (ABN), a van dear Waals 
material that can be easily exfoli- 
ated. After creating boron vacancy 
centers in thin layers of hBN by 
ion irradiation, the researchers 
interfaced these layers directly 
with the sample. The technique 
enabled them to simultaneously 
image magnetic field and temper- 
ature maps of the van der Waals 
magnet CrTe.. The sensitivity and 
resolution of the method might 
be further improved by optimizing 
the materials and the thickness of 
the sensing layer. —JS 
Nat. Phys. 10.1038/ 
$41567-022-01815-5 (2022). 
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MEDICINE 
Clinical benefits 
of psychedelics 


Classic psychedelic drugs, 
such as psilocybin and N,N- 
dimethyltryptamine (DMT), 
have been reported to have 
sustained benefit for various 
neuropsychiatric disorders and 
could have broader applica- 
tions. What will it take for 
classic psychedelic drugs to 
be used in clinical settings? In 
a Perspective, Schindler and 
D'Souza discuss the evidence for 
the clinical use of psychedelics 
and their proposed mechanism 
of action to induce neuroplasti- 
city. They also highlight the key 
issues to be overcome, such 
as understanding dosing and 
attendant safety questions and 
the problems with ensuring that 
clinical trial participants do not 
know they are taking the active 
drug. Nonetheless, psychedel- 
ics, either as single agents or as 
an assist, offer an opportunity 
to develop new therapeutic 
avenues for difficult-to-treat 
disorders, including addiction 
and depression. —GKA 

Science, abn5486, this issue p.1051 


BATTERIES 
Progress and challenges 
for electrolytes 


Compared with the development 
of new cathodes and anodes, 
there has been less of a focus on 
the development of electrolytes. 
However, it is the electrolyte 
that controls the flow of ions 
and charges, and it is the only 
component in intimate contact 
with all the others. With the push 
toward higher energy and power 
densities, electrolytes are also 
involved in kinetically formed 
interphases that aid in the 
stability of a battery but can also 
hamper its operation. In a review, 
Meng et al. captures a number 
of trends that have emerged in 
the development of advanced 
battery electrolytes. —MSL 
Science, abq3750, this issue p. 1065 
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SYSTEMS BIOLOGY 
Doing the math on the 
central dogma 


Gene expression can in theory 
be modulated at the level of 
transcription or translation, but 
both of these processes have 
constraints that complicate 
prediction of their outputs. To 
obtain a better quantitative 
understanding of the control 
of gene expression in bacteria, 
Balakrishnan et al. measured 
promotor on-rates, messenger 
RNA abundance, and protein 
abundance for more than 1500 
genes in the bacterium 
Escherichia coli under many dif- 
ferent growth conditions. Protein 
abundance largely reflects gene 
promoter on-rates and tran- 
scription, but has to comply with 
general constraints that keep the 
protein concentration constant 
and limit the number of ribo- 
somes—and thus translational 
capacity. The authors propose a 
balancing of transcription with 
translation through Rsd, a factor 
that controls the availability of 
RNA polymerase. Their results 
may be useful in the design of 
synthetic circuits in bacteria and 
the prediction of their behavior 
in various growth conditions. 
—LBR 

Science, abk2066, this issue p. 1066 


EVOLUTIONARY ECOLOGY 
Selection by modern 
agriculture 


Intensive agriculture creates 
extreme environmental changes, 
including soil disturbance, water 
and nutrient addition, and appli- 
cation of chemical pesticides. 
Kreiner et al. documented how 
these extreme environments 
have driven changes to a native 
North American agricultural 
weed, common waterhemp 
(Amaranthus tuberculatus), 
using genomic data from paired 
agricultural and natural settings, 
as well as historic herbarium 
samples from both habitat types 
(see the Perspective by Waselkov 


and Olsen). The authors found 
that eastward range expan- 
sion of a distinct southwestern 
variety introduced new genetic 
diversity and facilitated rapid 
adaptation to agricultural 
environments in genes related to 
growth, environmental tolerance, 
and herbicide resistance. —BEL 
Science, abo7293, this issue p. 1079; 
see also ade4615, p.1053 


COMPUTER SCIENCE 
Machine learning systems 
can program, too 


Computer programming 
competitions are popular 
tests among programmers 
that require critical thinking 
informed by experience and 
creating solutions to unforeseen 
problems, both of which are 
key aspects of human intel- 
ligence but challenging to mimic 
by machine learning models. 
Using self-supervised learn- 
ing and an encoder-decoder 
transformer architecture, Li et al. 
developed AlphaCode, a deep- 
learning model that can achieve 
approximately human-level 
performance on the Codeforces 
platform, which regularly hosts 
these competitions and attracts 
numerous participants world- 
wide (see the Perspective by 
Kolter). The development of 
such coding platforms could 
have a huge impact on program- 
mers’ productivity. It may even 
change the culture of program- 
ming by shifting human work 
to formulating problems, with 
machine learning being the main 
one responsible for generating 
and executing codes. —YS 
Science, abq1158, this issue p. 1092; 
see also add8258, p. 1056 


CHEMICAL GENETICS 
Beyond traditional 
drug design 


Drug design prioritizes com- 
pounds likely to cross the cell 
membrane, leading to rules 
governing the size, polarity, and 
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rigidity of chemical structures. 
However, there is increased 
interest in linking two chemo- 
types to allow greater selectivity 
for one target or to associate two 
targets. Lou et al. focused ona 
bitopic inhibitor, Rapalink-1, of 
the signaling protein mTOR that 
is active in vivo even though it is 
much too large to be considered 
drug like (see the Perspective 
by Lokey and Pye). The 
authors used chemical genetic 
approaches to show that the 
activity of Rapalink-1 depends on 
interferon-induced transmem- 
brane proteins (IFITMs). They 
went on to show that IFITMs 
promote the cellular uptake 
of diverse linked chemotypes 
and to explore the role of linker 
length on the uptake pathway. 
This work may guide the devel- 
opment of bitopic drugs. —VV 
Science, abl5829, this issue p. 1097; 
see also adf4412, p.1054 


PIEZOELECTRICS 
Get the lead out 


Piezoelectric materials convert 
an electrical signal into mechani- 
cal strain, making them vital as 
actuators and sensors. However, 
the best materials currently in 
use contain lead. Huangfu et al. 
found that doping a potassium— 
sodium niobium oxide with 
strontium improved the piezo- 
electric properties. This material 
had a high strain response 
at electric fields similar to its 
leaded counterparts and could 
potentially be a replacement for 
some lead-based piezoelectrics. 
—BG 

Science, ade2964, this issue p. 1125 


VACCINES 

Boosting up flu-specific 
lung T cells 

Despite the rapid embrace of 
messenger RNA (mRNA) vac- 
cine technologies catalyzed by 
the COVID-19 pandemic, the 
extent of the resident memory 


T cell (TRM) response in the 
lung after different routes of 
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mRNA vaccination remains 
uncertain. By immunizing mice 
with a self-amplifying mRNA 
vaccine for influenza nucleopro- 
tein encapsulated in dendritic 
nanoparticles, KUnzli et al. 
assessed how various routes of 
vaccine administration affected 
CD4 and CD&8 TRM responses. 
Whereas intramuscular prime- 
boost immunizations were 
sufficient to induce respiratory 
TRM, an additional intranasal 
boost further expanded both 
circulating and lung-resident 
CD4 and CD8 TRMs. These find- 
ings indicate that optimization 
of the routes of administration 
and sequence of mRNA vaccine 
doses can elicit robust antibody 
and lung TRM responses to 
pathogenic respiratory viruses. 
—IRW 


Sci. !mmunol. 7,eadd3075 (2022). 
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BATTERIES 


Designing better electrolytes 


Y. Shirley Meng*, Venkat Srinivasan*, Kang Xu* 


BACKGROUND: The electrolyte is an indispens- 
able component in every electrochemical device, 
including lithium-ion batteries (LIBs). It phys- 
ically segregates two electrodes from direct 
electron transfer while allowing working ions 
to transport both charges and masses across 
the cell so that the cell reactions can proceed 
sustainably. Whether powering our phones, 
driving our vehicles, or harvesting the inter- 
mittent energy from solar and wind farms, 
electrolytes in these LIBs determine how fast 
and how many times our devices can be re- 
charged or how efficiently energy can be cap- 
tured and stored over the grid. Occasionally, 
when an LIB is pushed away from the designed 
electrochemistry pathways by various factors 
such as excessive heat, mechanical mutilation, 
or internal short circuits induced under ex- 


treme charging conditions, electrolytes are also 
responsible for the fire and explosion accidents 
that we read about in the news. 

The electrolyte is the most unique compo- 
nent in a battery. Because it must physically 
interface with every other component, it is 
obligated to satisfy numerous constraints 
simultaneously, including rapidly transporting 
ions and masses, effectively insulating elec- 
trons, and maintaining stability toward the 
strongly oxidative cathode and strongly reduc- 
tive anode. Historically, the electrolyte-anode 
interfacing was the last piece of the puzzle to 
complete modern LIB chemistry. 


ADVANCES: The commercial success of LIBs 
has attracted intense interest and invest- 
ments in electrolyte research, which led to 


Electrolytes and the associated interphases play the central role in supporting diversified battery chemis- 
tries. On the anode side (left), the electrolyte must form an interphase that prevents graphitic anode from 
exfoliation, tolerates the drastic volume changes of a silicon electrode, and suppresses the growth of a dendritic 
form of lithium metal. On the cathode side (right), an interphase is critical in preventing the irreversible 
reactions with electrolytes, maintaining the lattice structure of transition metal oxides, suppressing the cross-cell 
shuttling of polysulfide species, and assisting the complicated triphasial reactions of an air-cathode. In all of these 
scenarios, interphases must enable ionic transport while insulating electronic transport. 
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the identification of interphases as the key 
component responsible for the stable and re- 
versible operations of cathode and anode mate- 
rials far beyond the thermodynamic stability 
limits of any known electrolyte. These inter- 
phases, often with nanometer thickness, are 
formed by electrolytes in a self-limiting de- 
composition process, and they ensure fast 
rates of charging and discharging, maxi- 
mum voltage, and reversibility of LIBs. In 
the past three decades, the chemistry, mor- 
phology, and formation mechanisms of in- 
terphases have been thoroughly investigated. 
Researchers have learned how such inter- 
phases are structured and what key ingre- 
dients they comprise and, most importantly, 
how to tailor them using electrolyte engineer- 
ing. Today, it is widely accepted that design- 
ing better electrolytes also implies designing 
the associated interphases for the electrode 
materials. Although the accurate prediction 
of interphasial chemistry remains difficult, 
and key fundamental properties of inter- 
phases such as the rate and mechanism of ion 
transport across interphases are still unknown, 
the structure of the ion solvation sheath has 
been identified as an effective tool that di- 
rects the formation process of interphases. 
Such knowledge has been driving a series of 
new electrolyte concepts for emerging bat- 
tery chemistries. 


OUTLOOK: Efforts are being made to develop 
battery chemistries that promise high energy 
density, rapid charging, low cost, high sustain- 
ability, and independence from elements or 
materials of high geopolitical or ethical risks. 
Each individual chemistry may demand a uni- 
que electrolyte and corresponding interphase, 
but a few universal trends emerge: (i) a super- 
concentration of salts is used to leverage 
unusual properties arising from the altered 
ion-solvation structures; (ii) both polymeric 
and inorganic materials are used to solidify 
electrolytes so that the aggressive lithium- 
metal anode can be harnessed with higher 
safety; (iii) efforts are made to identify the 
most effective interphasial ingredients so 
that an interphase of singular composition 
can be designed and artificially applied; (iv) 
liquefied gaseous components are used to 
expand the low-temperature limits of con- 
ventional electrolytes; and (v) unusual electro- 
chemical behaviors are explored by confining 
ion-solvation sheaths in nano- or sub-nano 
environments. & 
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BATTERIES 


Designing better electrolytes 


Y. Shirley Meng'2*, Venkat Srinivasan?**, Kang Xu°4* 


Electrolytes and the associated interphases constitute the critical components to support the emerging 
battery chemistries that promise tantalizing energy but involve drastic phase and structure 
complications. Designing better electrolytes and interphases holds the key to the success of these 
batteries. As the only component that interfaces with every other component in the device, an 
electrolyte must satisfy multiple criteria simultaneously. These include transporting ions while insulating 
electrons between the electrodes and maintaining stability against electrodes of extreme chemical 
natures: the strongly oxidative cathode and the strongly reductive anode. In most advanced batteries, 
the two electrodes operate at potentials far beyond the thermodynamic stability limits of electrolytes, so 
the stability therein has to be realized kinetically through an interphase formed from the sacrificial 


reactions between electrolyte and electrodes. 


ll electrochemical energy devices (fuel 
cells, batteries, and supercapacitors) oper- 
ate by separating charges (J), and the 
electrolyte is the indispensable compo- 
nent to enforce such charge separation 
(2, 3). The electrolyte physically segregates 
two electrodes from direct electron transfer 
while allowing working ions to transport both 
charge and mass across the cell so that the 
cell’s reactions can proceed sustainably. In a 
rechargeable battery, a qualified electrolyte 
not only conducts ions while insulating elec- 
trons (a property combination known as “elec- 
trolyte nature”), but also must remain inert to 
the cell reactions. The electrolyte dictates how 
fast the cell reaction can proceed (power den- 
sity) and how many times a battery can be 
charged and discharged (reversibility). Although 
an electrolyte does not directly determine the 
energy of a battery (which is instead dictated 
by both electrodes), it does so indirectly by 
allowing or disallowing an electrode depend- 
ing on whether electrochemical stability can 
be achieved between it and the electrolyte. 
To maximize energy density, the electrodes 
in advanced batteries are often pushed to 
operate at extreme potentials, where no known 
electrolyte is thermodynamically stable. In ad- 
vanced batteries such as lithium-ion batteries 
(LIBs), the stability between the electrolyte 
and electrodes (and consequently the revers- 
ibility of the battery) must be achieved kineti- 
cally. Thus, trace amounts of electrolytes are 
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sacrificially decomposed, and their solid products 
are deposited on the electrode surfaces, forming 
a protection against further decomposition. 
This protection layer in batteries is called 
“interphase” (4). Interphase formed in this 
self-limiting manner is almost always identi- 
fied on anode surfaces in LIBs, where it is 
referred to as a solid electrolyte interphase 
(SEI). When interphase occurs on a cathode 
surface, it is known as a cathode electrolyte 
interphase (CEI) (5), which is less definitive 
and complicated by several factors (3). First, 
most transition-metal oxide cathode materials 
already carry a native passivation layer of 
Li,COs, which in turn reacts with acidic electro- 
lytes in the battery environment through pure 
chemical pathways. Second, further evolution 
occurs through electrochemical pathway, lead- 
ing to eventual CEI chemistry. Finally, the 
surface of the cathode material might also be 
involved in the evolution, creating an inter- 
mediate phase and thus smearing the clear 
definition of where CEI starts. Nevertheless, 
the electrolytes and interphasial chemistries 
are closely associated. Because the electrolyte 
is the only component in a battery that is in 
contact with every other component, design- 
ing better electrolytes implies tailoring and 
balancing a host of properties, ranging from 
bulk (e.g., ion solvation and transport and ex- 
tended liquid structure) to interfacial struc- 
ture and stability (e.g., preferential assembly 
and orientation of ions and molecules at 
Helmholtz planes and corrosion suppression) 
to interphasial chemistry and morphology. Al- 
though the irreversible reactions leading to 
interphase formation mainly occur during 
the initial activation cycles of batteries, the 
interphases thus formed will dictate the power, 
energy, and reversibility of the battery during its 
entire service life. 

Differing from the two-dimensional inter- 
face in classical electrochemistry, interphase 
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constitutes an independent, three-dimensional 
existence with special chemistry and morphol- 
ogy. With nanometer thickness (~2 to 20 nm) 
and extremely high sensitivity, interphases have 
been the least understood component in re- 
chargeable batteries until recently, when various 
advanced characterization and computational 
techniques were developed to shed light on 
this critical component (6). 


LIBs enabled by interphases 


The energy density and rechargeability of LIBs 
rely on interphases (Fig. 1) (3). Because the 
potential of a graphitic anode in a LIB (~ 0.01 V 
versus Li) resides far beyond the thermody- 
namic stability window of the electrolyte, as 
marked by the highest occupied molecular 
orbitals (HOMOs) and lowest unoccupied molec- 
ular orbitals (LUMOs), the reversible lithiation/ 
de-lithiation chemistry relies on the protection 
from interphases (7). The electrolytes used in 
state-of-the-art LIBs vary with manufacturers 
and the cathode chemistry used; however, they 
almost exclusively follow a skeleton composi- 
tion consisting of lithium hexafluorophosphate 
(LiPF,) dissolved in a mixture of carbonate 
ester solvents, which is the result of balancing 
the requirements from every battery compo- 
nent (8), especially the graphitic anode at low 
potential (~0.01 V versus Li*/Li) and transition- 
metal oxides or phosphates cathodes at high 
potential (~3.5 to 4.5 V versus Li*/Li). It is now 
widely accepted that the SEI in LIBs mainly 
comes from the reductive decomposition of 
carbonate solvents and salt anions. These spe- 
cies are usually arranged in stratified structures, 
with the innermost layer being more inorganic 
(LiF and Li,O) and the outer layer more or- 
ganic (semicarbonates, oxalates, alkoxides, and 
polymers). Of special interest is the unbalanced 
contribution to SEI chemistry from one par- 
ticular solvent, ethylene carbonate (EC) (9). 

On the basis of numerous ex situ or in situ/ 
in operando characterizations, the solvation 
sheath structure of Li* has been identified as 
being a key factor in SEI chemistry (9-17). In 
the very first cycle, as the negatively charged 
anode attracts the solvated Li* to its surface, 
it is the solvent molecules in the solvation 
sheath that initially experience reductive de- 
composition, thus becoming the precursors 
of the nascent SEI. EC happens to be the 
solvent that preferentially solvates Li*. It is such 
solvation-interphase correlation that makes EC 
a preferred SEI contributor (9, 12, 13). Although 
the semicarbonate produced by EC decompo- 
sition constitutes the main SEI ingredient, 
recent studies have found that the nascent 
interphase can be electrochemically oxidizable 
(14), whereas long-term cycling turns it into 
monocarbonate (75). 

The capability of EC in strongly solvating Li* 
and in forming a protective interphase on a 
graphitic anode make it almost an indispensable 
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Fig. 1. How electrolytes work. The electrolyte 
must conduct ions, insulate electrons, and remain 
stable while simultaneously interfacing with all 
battery components. Substantial challenges arise 
when the cathode and anode operate at potentials 
beyond the region enclosed between LUMOs and 
HOMOs, where interphases must be formed to 
ensure the reversibility of the cell chemistry. 


electrolyte component in all LIBs. However, the 
prevalent presence of EC in electrolytes also 
causes poor low-temperature performance and 
sluggish charging rates of LIBs because of its 
high melting point (36.5°C), high viscosity 
(2.0 cP at 25°C), and strong coordination with 
Li*. Efforts have been made in recent years 
to replace EC with EC-free electrolytes based 
on acyclic carbonate such as ethylemethyl 
carbonate (melting point -53°C and viscos- 
ity 0.65 cP at 25°C) as the main solvent. When 
used in presence of fluoroethylene carbonate 
as a SEI-forming additive, this EC-free elec- 
trolyte could maintain most of the merits of 
EC-based electrolytes while circumventing 
the disadvantages (6, 17). 

Compared with SEI, there has been much 
less understanding about CEI. In certain cases, 
even the existence of CEI has been debated 
(8). Although studies show that conventional 
electrolytes could support cell operations up to 
5.6 V under certain conditions (19), in cells that 
must withstand long-term cycling, the need for 
a CEI is still fundamentally driven by the mis- 
match of thermodynamic factors (Fig. 1). Un- 
like SEIs, which fully cover an anode, the 
depositions on the cathode surface, mostly 
inorganic fluorides and oxides, have often been 
found to be scattered and discontinuous, raising 


Meng et al., Science 378, eabq3750 (2022) 


suspicion about whether they indeed function as 
protection (20). This was partially because the 
operating potentials of most cathode materials 
used in LIBs (<4.2 V) do not deviate too far 
from the limits allowed by electrolyte HOMOs 
(~4.0 V), but also because CEI chemistry is 
much more complicated. It involves not only 
electrolyte but also the decomposition and 
phase transformation of the cathode itself, 
thus causing the problem of defining where 
the interphase should start and end. However, 
with the development and commercial adop- 
tion of various high-voltage (>4.5 V) cathode 
materials in LIBs, the necessity for CEI is no 
longer questionable, just like SEI (27). 
Researchers have reached a consensus that 
electrolyte and interphase are convoluted, and 
that the solvation structure for Li* (or any 
other cations essential to the cell chemistry, 
such as Na*, Mg”*, Ca?*, and Zn**) is of vital 
importance when designing new electrolyte 
systems (22). The most direct fruition of the 
solvation-interphase correlation is perhaps 
the superconcentration concepts that led to 
diversified new electrolyte systems, including 
water-in-salt electrolytes (WiSEs) (23-25). 


Designing electrolytes and interphases for 
emerging battery chemistries 


In an era where LIB has become a mature tech- 
nology, emerging battery chemistries promising 
either extremely high energy or power or low 
cost and high sustainability are being actively 
pursued. Alternative working ions such as Na* 
(26), K* (27), Mg”* (28), Ca?* (29), and even Al?* 
(30) are also under consideration. To support 
these more energetic and much more aggressive 
chemistries that either operate at extreme po- 
tentials with drastic morphological and volume 
changes (Li° and Si), or involve extremely reac- 
tive surfaces (Li° and air cathodes), or rely on 
working ions difficult to move (Mg**, Ca’*, 
and Zn"), the designing of electrolytes and 
the associated interphases must address dif- 
ferent challenges. 

On the anode side, Si is gradually replacing 
graphite as anode host for Li*, which theoret- 
ically could provide 10 times higher specific 
capacity than graphite. However, the many- 
fold volume change of Si accompanying such 
high capacity presents severe challenges to 
the electrolyte and the associated SEI, which 
must withstand not only the low potential of 
Si (~0.5 V vs. Li) but also the mechanical 
stress and incessant exposure to new Si sur- 
faces (37). Although making Si nanosized or 
embedding it in carbonaceous matrices could 
relieve the stress and fragmentation (32), fluo- 
rinated solvents such as fluoroethylene car- 
bonate and the highly fluorinated salt anion 
fluorosulfonyl imide have also been found to 
stabilize Si effectively, leading to highly fluori- 
nated SEIs containing LiF and organic fluorides 
that help to prevent Si fragmentation (33). 
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Much more severe challenges are associated 
with lithium-metal (Li°) anodes, which are 
being revisited as the ultimate anode mate- 
rial because of their energy density (34). The 
promised high energy of Li° is accompanied 
by extreme reactivity, which is associated with 
excessive and incessant interphase growth, 
leading to both low efficiency of the Li° electrode 
and dangerous morphologies such as dendritic 
and dead Li°. Identifying an electrolyte with 
effective SEI chemistry has become a heavily 
invested research direction, with the goal that 
Li° deposition could proceed homogeneously 
at high Coulombic efficiency (CE%). It is of 
special importance to circumvent fractal-like, 
long-aspect-ratio crystals (Fig. 2), which could 
either penetrate the cell and create an internal 
short circuit or be easily cut off near the roots 
and form highly dangerous dead Li° (35). The 
best electrolytes identified so far are based 
on fluorinated ether molecules rather than 
esters, which can achieve CE% as high as 
99.9% and suppress dangerous Li® morphol- 
ogies during long cycling (36). The improved 
Li® reversibility from such general fluorination 
of both electrolyte composition and interphasial 
chemistry has been attributed to the high sur- 
face energy for Li° to nucleate on the LiF. 

Metals corresponding to multivalent cations 
(Mg”*, Ca?*, Zn?*, and even Al**) are of high 
interest as possible anode materials with the 
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Fig. 2. The challenge of the Li° anode. Designing 
an electrolyte and the associated SEI to eliminate 
dangerous morphologies of dendritic and dead Li? is 
the key to enabling rechargeable lithium-metal 
batteries. Top: an inhomogeneous SEI encourages 
the growth of dendritic Li° at spots that are more 
conductive to Li*, whereas dissolution near the root 
of the dendrite results in dead Li° that is electrically 
isolated from the current collector. Middle and 
bottom: designing better electrolytes and inter- 
phases guides the growth of dense Li° so that 
crystals of low aspect ratios are preferred, which 
means that they stay electrically attached during 
repeated plating and stripping. 
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Fig. 3. Emerging conversion-reaction battery chemistries. Both iron fluorides (top) and sulfur (bottom) 
rely on drastic structure breaking and structure reforming to deliver high capacities and energy densities, but 
heir reversibility encounters severe difficulty because it is almost impossible to completely restore the original 
structure, and the active materials generated could be soluble in electrolytes and leave electrode bulk. 


expectation that multiple electrons involved in 
their reactions would favor the specific capac- 
ity (28~30, 37). However, the Coulombic force 
exerted by these multivalent cations on their 
environments (counterions, solvent molecules 
in bulk electrolyte, polyanions in lattice struc- 
ture, or coordination sites in interphases) is so 
intense that either the multivalent salts have 
limited solubility in the electrolytes or the sol- 
vent molecules are so strongly bound to the 
multivalent ion that there is the possibility of 
their being electrostatically decomposed (38). 
Thus, desolvation at interphases becomes im- 
possible or the multivalent cations are not able 
to travel across the interphases at all (37, 39). 
Electrolyte design must resolve these problems, 
with the aim of forming (i) weakly coordinated 
electrolytes so that the cations are less solvated 
and the desolvation energy barrier could be 
more easily overcome, (ii) reduction-resistant 
electrolytes using various hydrides and organo- 
metallic Grignard reagents so that no SEI forms, 
or (iii) superconcentrated electrolytes in which 
networked anion clusters could assist in liberat- 
ing cations for movement. Unlike LIB electro- 
lytes, ethers rather than esters are exclusively 
favored here because of the intrinsic reactivity 
of the latter with these metals. 

However, regardless of whether they are based 
on hydrides, Grignard reagents, or ethers, these 
reduction-resistant electrolytes raise new issues 
on the cathode side because they are all vul- 
nerable against oxidation. Therefore, it remains 
to be seen whether these electrolytes can with- 
stand the high voltage of multivalent cathode 
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materials, which are still under development 
themselves. Prior experience with LIBs teaches 
us that the best approach to resolving such 
convoluted interests from the anode, cathode, 
and bulk electrolyte is an interphase that can 
effectively decouple them (40). 

Among all multivalent cation chemistries 
that still largely remain remote from practical 
applications, Zn stands out as an exception. As 
the anode material in Alessandro Volta’s first 
battery in 1799, Zn chemistries have powered 
us from the Industrial Revolution into the 
modern era, and they are still being used in the 
alkaline batteries available from drugstores. Zn 
reduction/oxidation potential sits at a moder- 
ate value (-0.76 V versus standard hydrogen 
potential) that allows for aqueous electrolytes; 
however, its poor reversibility has rendered 
these batteries essentially nonrechargeable. 
This situation has witnessed improvements 
in the past 5 years (41), when Zn? reversibility 
has progressed toward practical application, 
aided by advances in understanding Li° anode 
reversibility (42). The missing piece of a re- 
chargeable Zn battery is a viable low-cost 
cathode material that could provide high re- 
versibility and high capacity with Zn. The most 
promising candidates include air cathodes that 
use oxygen from the ambient air (43) and various 
intercalation hosts for Zn?* (44). 

On the cathode side, the pursuit for emerg- 
ing chemistries has been focused on the search 
for either extremely high energy densities (45) 
or materials that are independent from Co 
or Ni, the two elements that LIB chemistries 
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have been heavily reliant on, but are either 
extremely rare in the Earth’s crust or are only 
available at high geopolitical or ethical risk. 

As near-term goals, relatively more mature 
materials, such as improved versions of inter- 
calation compounds, have been explored, some 
of which are already being gradually adopted 
by the LIB industry. Conceptually, they do not 
differ appreciably from the state-of-the-art 
LIB cathode materials, but the structural mod- 
ifications of their lattices allow them to ac- 
commodate more Li* at higher potentials. The 
most popular approach to tailoring electro- 
lytes for these aggressive cathodes are various 
electrolyte additives designed to have higher 
HOMOs so that even at a minor presence (from 
a parts-per-million level to a few percent) in 
the electrolyte, their oxidative decomposition 
could still dominate the CEI chemistry and 
form a better protection for both bulk elec- 
trolyte and the cathode itself (46). In some 
cases, the cathode structure is designed so 
that the redox reaction is no longer confined 
to the transition metals. Instead, the oxygen 
in the lattice could be harnessed for additional 
capacities. The electrolytes for such aggressive 
materials must display special stability against 
the reactive peroxide or superoxide species 
thus produced (47). 

More radical materials leveraging conversion- 
reaction chemistries are being considered as 
long-term targets (48), including metal fluo- 
rides, sulfides (49), elemental sulfur (50), or even 
oxygen (57), which had been unsuccessfully ex- 
plored in the early days of the rechargeable 
battery research. Unlike intercalation-type elec- 
trodes with minimal structural change, these 
electrode materials experience complete re- 
structuring during the discharge, enabling 
significantly higher capacities, but at the ex- 
pense of reversibility because it is almost 
impossible to completely restore the original 
electrode structures (Fig. 3). Because of ad- 
vances made in the synthesis of nanostructured 
materials, these once-inaccessible chemistries 
could be revisited with higher feasibilities. The 
corresponding electrolytes and interphases are 
thus required to resist the strong catalytic ac- 
tivity from the nanosized metallic particles 
generated from the metal fluorides or sulfides, 
suppress the parasitic shuttling of soluble poly- 
sulfides across the cell, and remain inert to the 
reactive peroxide or superoxide intermediates 
from oxygen reduction (48-51). There have 
been limited improvements made thus far in 
addressing these challenges, leaving a wide 
space for researchers to explore. 


New concept electrolytes and interphases 


Substantial efforts are being made not only 
in designing and synthesizing new electrolyte 
components (solvents, salts, and additives), but 
also in making use of already-existing compo- 
nents in an innovative manner. These can be 
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condensed into a few general approaches that 
have been effectively applied to a wide spectrum 
of battery chemistries from Si and Li° to 
multivalent metals and from improved inter- 
calation to conversion-reaction materials. 


Superconcentrating 


Traditionally, electrolytes were confined to a 
salt concentration around 1 molarity (or 1 molal- 
ity) to maximize ion transport. This limit was 
lifted after researchers realized that an extra- 
high salt concentration could bring a series 
of benefits that offset the loss in ion transport 
due to the viscosity increase (52, 53). These 
benefits include nonflammability, anticor- 
rosion at high voltage, high cation transfer- 
ence, and, most importantly, new interphasial 
chemistries. The rationale underneath such 
changes lies in the solvation sheath structures 
of both cation and anion, which are signif- 
icantly altered at these unusual salt concen- 
trations. One extreme example is the WiSE 
(23-25), in which 21 molality or higher lithium 
salts dissolved in water enable the formation 
of an SEI in water and expand the electro- 
chemical stability window of aqueous electro- 
lytes from 1.23 V into the regime of >3.0 V to 
support high-voltage aqueous LIBs. At such 
extra-high salt concentrations, not only are sol- 
vation sheaths restructured, but in extended 
length scales, the hydrogen bonding between 
water molecules completely disappears and an 
anion-cation network forms. The supercon- 
centration concept applies to aqueous, non- 
aqueous, and even hybrid electrolyte systems. 
One particular modification is the so-called 
localized high-concentration electrolytes, in 
which a noncoordinating co-solvent (usually 
polyfluorinated ethers) is introduced to further 
disrupt the liquid structure of the parent electro- 
lyte so that the overall salt concentration would 
be near 1.0 M but locally the cation still sees a 
solvation structure similar to a superconcentra- 
tion regime (54). 


Singularizing 

Because it has been widely established that 
interphases formed in situ on an electrode are 
highly inhomogeneous in both chemistry and 
morphology, efforts are being made to iden- 
tify which chemical is the single most effec- 
tive ingredient in ensuring cell reversibility. 
On the basis of this knowledge, efforts could 
then be made to enrich such chemistry in 
interphases or even make the interphases in 
singular composition. Currently, the attention 
has been focused on two distinct inorganic 
salts, LiF and Li,O, as effective yet manipu- 
lable SEI components. 

The presence of various fluorides, either 
organic or inorganic, has been long identified 
in both SEIs and CEIs, but no simple linear 
relation could be established between fluoride 
content and interphase performance. Fluorine 
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is the most resistive element toward both oxi- 
dation and reduction, so it should make an 
ideal ingredient for interphases designed to 
support aggressive battery chemistries. How- 
ever, fluorides are excellent insulators not only 
to electrons but also to ions, so it is counter- 
intuitive that a pure fluoride-based interphase 
could function at all. Simulations reveal that 
when fluorides interface with other impurities 
such as oxides or carbonates at nanometric 
scale, highly conductive pathways for Li* can 
be created along the grain boundaries (55). It 
becomes increasingly apparent that fluoride 
distribution in the interphases and interac- 
tions with other species are more critical than 
the fluoride content alone (56). Thus far, 
knowledge about precisely tailoring the nano- 
metric morphology of these fluoride species is 
still missing. A semi-empirical approach adop- 
ted as the design philosophy is that fluorine 
must be prestored in the structures of various 
solvents, additives, or anions, and would then 
become available only when electrochemically 
activated. Fluorides formed in this manner 
would exist in nanometric scales and well 
interfaced with other interphasial ingredients. 
Highly fluorinated interphases have been made 
with impressive performances, examples of 
which include the almost pure LiF- or NaF- 
based SEIs formed in WiSE from TFSI anions 
and the LiF-enriched CEIs formed in fluori- 
nated ether electrolytes (23, 24). 

Li,O was identified recently as an alterna- 
tive interphasial ingredient on Li° on the basis 
of observation and analysis achieved under 
cryogenic electron microscopy (57, 58). Unex- 
pectedly, highly fluorinated ether electrolytes 
that have led to fluorinated CEI on a NMC 
cathode simultaneously generated Li,O-based, 
monolithic SEIs on a Li° surface without any 
detectable fluorides (58, 59). More pure inter- 
phases of singular chemical composition were 
thus attempted, in which both LiF and Li,O 
were artificially applied on Li° surfaces using 
a suspension of corresponding nanoparticles 
(60). It is anticipated that with increasing 
understanding about interphasial chemistry 
and morphology, we will witness further such 
efforts with new chemical ingredients. How- 
ever, caution should be exercised here because 
we actually know very little about how LiF or 
Li,O functions in an interphase. The corre- 
lation between their presence in interphases 
and their effect on electrochemical performance 
remains superficial and highly empirical. More 
fundamental knowledge is needed before we 
can determine whether an ideal interphase 
should be singular or heterogeneous. 


Solidifying 

Making electrolytes solid has always been de- 
sirable because solid electrolytes (SEs) provide 
nonflammability, zero leakage, and mechanical 
resistance against Li°-dendrite growth, thus 
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offering more convincing feasibility of enabl- 
ing Li° anodes in wide range of operating 
temperatures (67). Decades ago, researchers 
started seeking suitable materials with an 
excellent electrolyte nature, i.e., high ionic 
conductivity (o;;, > 0.1 mS/cm) but low electro- 
nic conductivity (0. < 10-” mS/cm). These efforts 
have covered organic materials such as poly- 
meric, inorganic ceramic or glassy materials 
such as oxides, halides, and sulfides as well as 
composites that combine all of them (61, 62). 

Polymeric ion conductors are of interest be- 
cause of their flexibility and processibility, but 
the coupling between the segmental motion 
of the oligoether chain and the ion transport 
constitutes an insurmountable barrier to achiev- 
ing sufficient ion conductivities at ambient 
temperature (63). The average conductivity of 
~10° S/em at 25°C restricts the use of these 
polymer electrolytes in batteries to niche ap- 
plications at elevated temperatures, such as 
the solid lithium-metal batteries adopted in 
certain electric vehicles in Europe. Fundamen- 
tal breakthroughs are needed to decouple ion 
transport from polymeric segmental relaxation, 
which requires a new ion transport mechanism. 
One recent advance in which cellulose chains 
are artificially expanded by bridging copper 
ions so that Li* can travel independently from 
the polymeric chain represents one such di- 
rection (64). 

Among the inorganic solid materials, sul- 
fides have emerged as a top contender, with 
high ionic conductivities at room temperature 
(>10? S/em) and high compliance to be pro- 
cessed into ultrathin dense layers, but their 
moderate electrochemical stability requires 
extra interphasial protection against extreme 
potentials of numerous cathode and anode 
chemistries (67). Argyrodite LigPS;Cl (LPSCI) 
has been revealed to form interphases with both 
Li° and Si, which is dominated by electronic 
insulating yet ionic conductive salts such as 
LiCl, Li,S, and LizP (65). For high-voltage 
cathodes such as layered transition-metal 
oxides, sulfides are usually protected by arti- 
ficial interphases that are formed by coating 
on sulfide SEs a layer of ceramic with high 
oxidative stability, examples of which are 
lithium niobates and lithium borates. With 
tailored composition and interfacial engineer- 
ing, sulfide-based SEs could work at voltages 
>4.2 V (66). 

Various types of scalable, low-cost, uniform 
coating strategies on cathode materials are 
still being explored, because the mismatch in 
chemical and mechanical properties between 
the cathode active materials and the SE could 
lead to uncontrolled impedance growth that 
negatively affects cell performance (67). Coating 
layers not only chemically and electrochemically 
protects the interfaces from decomposition, but 
also serves as a mechanical buffer to stabilize 
solid-solid interfaces. The formation of dendritic 
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Fig. 4. Interfacing solid with solid. Interfacing 
solid materials in an all-solid battery encounters 
both challenges and opportunities with the solid- 
solid contact points of active anode/cathode 
materials and the SEs. 


and dead Li® continues to plague Li metal 
anodes because of imperfections and inho- 
mogeneity at metal-SE interface, leading to 
these dangerous Li° morphologies that become 
exacerbated at higher current densities. Re- 
cently, the inhomogeneous stripping of lithium 
and the subsequent formation of voids have also 
become a focus of research (68). Plastic defor- 
mation of the metal through mechanical com- 
pression shows promise in preventing dendritic 
and dead Li° (69, 70). However, obtaining these 
pressures may require significant cell-level pen- 
alties because of the need for special fixturing. 

The paramount challenge for SEs still comes 
from solid-solid interfaces, because SEs cannot 
flow or infiltrate into the porosity structure 
of the electrodes. A common approach to 
overcoming the physical contact issue is the 
addition of a controlled amount of liquid 
electrolytes to fill the voids and gaps. This 
renders the system less solid and results in a 
hybrid electrolyte or semi-SE (77). Conversely, 
the solid-solid contact challenge could also be 
turned into an advantage to enable three- 
dimensional (3D) architectures for the Li or Si 
anodes, because the interphasial reactions can 
only occur at those contact points, whereas 
large volume expansion and shrinkage can be 
accommodated with appropriate 3D design 
and dynamic pressure control (Fig. 4) (72). The 
single-ion conducting nature of SEs eliminates 
concentration polarization and allows ac- 
commodation of local effects that cause non- 
uniform reaction and eventual capacity fade 
(73). Although it remains a key hurdle for 
the battery field to quantify the coupled 
electrochemical-mechanical phenomena, recent 
work has been rather encouraging, showing 
a stable and long cycle-life of laboratory-scale 
batteries. 
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Fig. 5. Solvation in nanoconfinement. When the host environment becomes comparable in size with the 
ion solvation sheaths, forced interaction between the host and ion or its solvent members becomes inevitable, 


which induces partial or complete desolvation of the io 


Liquefying 

In addition to liquids and solids, a new frontier 
has been created in which a quasi-gases are 
harnessed as electrolyte solvents. These efforts 
enable operation at lower temperatures not 
otherwise possible with typical electrolytes. 
Relatively polar gaseous hydrofluorocarbon 
molecules such as fluoromethane (CH3F) can 
be liquefied under moderate pressure (74), 
which dissolves salts at room temperature to 
form liquefied gas electrolytes. These electro- 
lytes are nontoxic and have relatively strong 
covalent bonds that allow for a wide electro- 
chemical window. Because of their small mole- 
cular size and weak intermolecular attraction, 
these solvents display exceptionally low viscosity 
that enables a superior solvent dielectric-fluidity 
factor that is higher than that of typical electro- 
lyte solvents such as esters and ethers. The 
solvents also have extremely low melting points 
that render batteries operational below -60°C. 
Additives such as tetrahydrofuran (THF) (75) 
can be introduced to alter the Li*-solvation 
structure, which has been fully populated with 
CH3F molecules. The intrinsic fluorine-rich 
chemistry and the self-equilibrated pressure 
enable Li® reversibility even at very high cur- 
rent densities (10 mA/cm?) and low temper- 
atures. Other types of co-solvent could also 
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be used to widen the operation temperature 
range, especially at the upper limit, where 
salt precipitates and ion transport shutdown 
occurs. More recent use of difluoromethane 
(CH,F,) in combination with dimethoxyethane 
co-solvent have shown such an improvement 
(76), and the concept was also extended to 
multivalent chemistries such as Zn (77). These 
unconventional electrolytes, although confined 
to limited applications because they require a 
hermetical stainless steel case, have significant 
potential for further development. 


Nano-contining 


The size of most ion solvation sheaths ranges 
from an angstrom to nearly a nanometer, de- 
pending on the solvent molecules and the 
solvation numbers. With the emergence of 
diversified nanomaterials, an interesting ques- 
tion arises: How would the solvated ions be- 
have in these sub-nano structures? There have 
been numerous reports on the unusual prop- 
erties of electrolytes in nano-confinement. For 
example, anomalous capacitance arose when 
solvated tetraalkyl ammonium cations were 
forced into pores as small as 0.65 nm (78). 
Significantly different interphasial behaviors 
and morphologies of Li® anode emerged 
when infusing conventional LIB electrolytes 
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modeling bring a new horizon to the understanding and discovery of electrolytes and their interphases. 


into nanopores of a ceramic-polymer composite 
host of mean diameter ~40 nm (79). The electro- 
chemical stability window was significantly ex- 
panded when ether-based electrolytes were 
frozen into the sub-nano channels of 0.29 nm 
created by metal-organic framework struc- 
tures (80, 8/7). Finally, extraordinarily fast ion 
transport far above the corresponding bulk 
polymer electrolyte occured along the ceramic- 
polymer interfaces when the same polymer 
was embedded into the channels of a ceramic 
host as small as 40 nm in diameter (82). Al- 
though these scattered phenomena do not seem 
to be related at first glimpse, and the inves- 
tigators in each study offered widely different 
rationales, there is a converging factor in all of 
them: In the nanostructured environment, the 
solvated ions were likely to interact with the 
surfaces of the host materials (Fig. 5), during 
which partial or even complete desolvation of 
the ions occured, producing electrolytes at 
chemical states that are not fully understood 
yet. Therefore, there might be opportunities 
to leverage these conditions. 


Outlook: A bright future with apprehensions 


A battery is a multicomponent system consist- 
ing of at minimum a positive electrode (cathode), 
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a negative electrode (anode), and an electro- 
lyte. For the battery chemistry to work, all of 
these components must synchronize with each 
other. The failure of a battery often occurs not 
as a result of a single malfunctioning compo- 
nent but rather because of the interactions 
between different components. 

Constrained by such a synchronization re- 
quirement, the electrolyte has often been the 
languishing component (8) because it must 
simultaneously interface with every other com- 
ponent in a battery, be it active components 
such as the cathode and anode or inactive 
components such as separators, current collec- 
tors, and cell packaging. A significant amount 
of knowledge has been generated about electro- 
lytes and their relation to interphase. Neverthe- 
less, many basic questions remain unanswered, 
such as: (i) How exactly do ions travel across 
interphases consisting of intrinsically insulating 
materials? (ii) How can one precisely measure 
electronic and ion conductivity within the 
interphases? (iii) Which components are ef- 
fective and functional in the interphases and 
which are unnecessary and redundant? This 
list of questions will surely grow as new systems 
are developed and explored. In the meantime, 
new insights are being provided by various 
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situ and in situ/in operando characterizations assisted by computational simulation and 


analytical methods that were previously un- 
available. For example, combined chemical 
analysis, cryogenic electron microscopy, and 
pair distribution function (PDF) acquired from 
a synchrotron revealed that the interphasial 
chemistry on Li° is much more complicated 
than was once believed, with the exotic inter- 
phasial components such as glassy Li? and 
lithium hydride still under debate (83, 84). 
To answer all of these questions, character- 
ization techniques, especially those operational 
under in situ/in operando conditions and those 
with extremely high spatial and temporal re- 
solutions, are needed. In the past decades, 
although many advanced characterization tools 
have been developed to probe and quantify the 
interphasial species, tools that directly probe 
the electrolyte are still absent. Given the non- 
crystalline and fast ion dynamics nature of 
typical electrolytes, spectroscopic methods with 
high chemical sensitivity and total scattering 
for local structure are most suitable because 
they are tolerant to long-range disordered ar- 
rangements. As shown in Fig. 6, a comprehen- 
sive understanding about solvation structure in 
electrolytes, the associated interphase chemis- 
try, and its distribution along 2D and 3D length 
scales, as well as how the chemistry evolves 
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during electrochemical processes, could be 
achieved by the study of large datasets col- 
lected using a combination of a host of tech- 
niques. These include liquid and solid nuclear 
magnetic resonance (13), differential PDF de- 
rived from x-ray or neutron scattering (85), 
and cryogenic analytical electron microscopy 
equipped with tomography and electron energy 
loss spectroscopy (86). 

As highly powerful supercomputing capa- 
bilities become more accessible, complicated 
processes in electrolyte bulk and interfaces 
are increasingly available for modeling with 
improved accuracy. Computational methods 
increasingly play a role in the discovery of new 
electrolytes and additives by using density 
functional theory to screen the thermodynamic 
oxidative and reductive stability of various 
organic molecules (87), in predicting ion mobi- 
lity in SEs using methods such as nudged 
elastic band coupled with ab initio molecular 
dynamics (88), as well as in understanding 
interfacial and interphasial processes (89, 90). 
These approaches have led to the development 
of infrastructures and associated databases 
to provide electrolyte information to the wider 
research community (97). With the increased 
interest in automated high-throughput exper- 
imentation (so-called robotic labs), these com- 
putation screening approaches are now aided 
by large volumes of experimental data, lending 
themselves to analysis using machine learning 
approaches (92). 

Computation predictions complement expe- 
rimental characterization methods by offering 
insight into the detailed and transient pro- 
cesses at interfaces. The recent molecular dy- 
namic simulations of WiSE and its derivatives 
on electrified surfaces have revealed the pos- 
sibility of predicting interphasial chemistries 
based on how the solvents and ions assemble 
and position in the inner-Helmholtz layers of 
the electrode before interphase formation (89). 
Further, molecular dynamic simulations also 
shed light on the reduction reactions responsi- 
ble for interphase formation producing solid 
and gaseous products including C,H,, LigCOs, 
LiF, and LEDC (93). 

Finally, the increasing interest in developing 
electrolytes with enhanced transport properties 
has shed light on scale-bridging approaches 
linking ion correlations at the atomic scale to 
transport properties at the continuum scale 
using Stefan-Maxwell formalism (94). These 
methods have recently incorporated additional 
phenomena such as solvent motion (95) and its 
role in the precise definition of ion transference 
and the role of morphological effects at the 
nanoscale on conductivity (96), both of which 
use large-scale computations. 

Although computational methods have 
blossomed over the past decade, further efforts 
are needed to move beyond thermodynamic 
calculations into the prediction of chemical 
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reactions so that the cascade of reactions 
leading to the formation of interphases can 
be precisely described and tighter coupling 
between experimental methods and predic- 
tions can be forged. Artificial intelligence and 
machine learning methods will eventually grow 
to a leading role in these endeavors. Validation 
of modeling requires further support from ad- 
vanced characterization. 

The past decade has witnessed the explosive 
growth of research activities on battery chem- 
istry and materials, with electrolyte develop- 
ment taking a substantial role. This emphasis 
on an area of vital importance to the future 
of our planet and civilization will be sustained 
and driven further by anxieties over both geo- 
political and supply chain risks. Given that a 
battery is a system consisting of multiple com- 
ponents, complications often arise when evaluat- 
ing a single component in different environments 
and settings. To enforce the reproducibility and 
comparability of the battery performance data 
in the literature, rigorous practice and stand- 
ardized protocols by researchers are strongly 
encouraged. Some efforts have already been 
made in this direction (97, 98), and we view 
these as indispensable guidelines that the bat- 
tery, chemistry, and materials research com- 
munity should follow strictly. 
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INTRODUCTION: The intracellular concen- 
tration of a protein depends on the rates of 
several processes, including transcription, 
translation, and the degradation and/or 
dilution of messenger RNAs (mRNAs) and 
proteins. These rates can be vastly differ- 
ent for different genes and across different 
growth conditions because of gene-specific reg- 
ulation. At the systems level, protein concen- 
trations are further affected by the availability 
of shared gene expression machineries—e.g., 
RNA polymerases and ribosomes—and are 
constrained by the approximately invari- 
ant cellular mass density. Even in one of 
the best-characterized model organisms, 
Escherichia coli, it is unclear how the gene- 
specific and systems-level effects work to- 
gether toward setting the cellular proteome. 
This knowledge gap has not only hindered 
our efforts in building a predictive frame- 
work of gene expression but has also lim- 
ited our abilities in guiding the rational design 
of gene circuits. 


RATIONALE: We undertook a quantitative, 
genome-scale study, combining experimental 
and theoretical approaches, to tease apart the 
contribution of the specific and global effects 
on cellular protein concentrations in expo- 
nentially growing E. coli cells across a variety 
of growth conditions. We complemented 
genome-scale proteomic and transcriptomic 
data with biochemical measurements of total 
absolute mRNA abundances and synthesis 
rates. We compared these measurements to 
gene dosage and the concentrations of ribo- 
somes and RNA polymerases to quantitatively 
characterize the activity of the gene expression 
machinery across conditions. This comprehen- 
sive dataset allowed us to analyze, in quanti- 
tative detail, the interplay between the activity 
of gene expression machinery, the activity of 
individual promoters, and the resulting pro- 
tein concentrations. 


RESULTS: We compiled a comprehensive atlas 
of the determinants of gene expression across 


Ribosomes 


Coordination of 
transcriptional and 
translational activities 


Similar mRNA characteristics 
across most genes 


Principles governing gene expression in E. coli. The availability of RNA polymerases (RNAPs) is 
coordinated with that of ribosomes, and the mRNA characteristics (translation efficiency and degradation 
rates) are uniform across most genes and growth conditions. These two principles prescribe a simple 
gene expression strategy in which protein concentrations are almost entirely controlled at the promoter level 


for most genes. 
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conditions—from the concentrations of genes, 
mRNAs, and proteins to the rates of transcrip- 
tional and translational initiation and mRNA 
degradation for thousands of genes. We were 
able to determine the on rate of each pro- 
moter, a quantity capturing the overall effect 
of transcriptional regulation that has been 
elusive through most existing gene expres- 
sion studies. Unexpectedly, we found that for 
most genes, the cytosolic protein concentra- 
tions were primarily determined by the innate 
magnitude of their promoter on rates, which 
spanned more than three orders of magnitude. 
Changes in protein concentrations resulting 
from changes in growth conditions were typ- 
ically much smaller—well within one order of 
magnitude—and were mostly exerted through 
changes in transcription initiation. 

E. coli’s strategy to implement gene reg- 
ulation can be summarized by two design 
principles. First, protein concentrations are 
predominantly set transcriptionally, with rela- 
tively invariant posttranscriptional character- 
istics (translation efficiencies and degradation 
rates) for most mRNAs and growth conditions. 
Second, the overall fluxes of transcription 
and translation are tightly coordinated: The 
average density of five ribosomes per kilo- 
base is nearly invariant across mRNA species 
and across growth conditions, even though 
the mRNA and ribosome abundances can 
each vary substantially. We find this coor- 
dination to be implemented through the 
anti-sigma factor Rsd, which modulates the 
availability of RNA polymerases for transcrip- 
tion across different growth conditions. These 
two principles lead to a quantitative formu- 
lation of the central dogma of bacterial gene 
expression, connecting mRNA and protein con- 
centrations to the regulatory activities of the 
corresponding promoters. 


CONCLUSION: These quantitative relationships 
reveal the unexpectedly simple strategies used 
by £. coli to attain desired protein concentrations 
despite the complexity of global physiological 
constraints: Individual protein concentrations 
are primarily set by gene-specific transcrip- 
tional regulation, with global transcriptional 
regulation set to cancel the strong growth rate 
dependence of protein synthesis. These rela- 
tions provide the basis for understanding the 
behavior of more complex genetic circuits in 
different conditions and for the inverse prob- 
lem of deducing regulatory activities given the 
observed mRNA and protein levels. 
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Protein concentrations are set by a complex interplay between gene-specific regulatory processes 

and systemic factors, including cell volume and shared gene expression machineries. Elucidating 

this interplay is crucial for discerning and designing gene regulatory systems. We quantitatively 
characterized gene-specific and systemic factors that affect transcription and translation genome-wide 
for Escherichia coli across many conditions. The results revealed two design principles that make 
regulation of gene expression insulated from concentrations of shared machineries: RNA polymerase 
activity is fine-tuned to match translational output, and translational characteristics are similar 

across most messenger RNAs (mRNAs). Consequently, in bacteria, protein concentration is set primarily 
at the promoter level. A simple mathematical formula relates promoter activities and protein 
concentrations across growth conditions, enabling quantitative inference of gene regulation from 


omics data. 


ene expression involves transcription of 

genes into mRNAs followed by trans- 

lation of mRNAs into proteins. Protein 

concentrations are in turn determined 

by the balance between protein synthe- 
sis and dilution (Fig. 1A and fig. S1) for ex- 
ponentially growing bacteria in which protein 
degradation is negligible (7, 2). Intuitively, 
doubling the transcription initiation rate of 
a specific gene by an activator should result 
in doubling the concentration of the corre- 
sponding mRNA and protein in the absence 
of posttranscriptional regulation (3-5). How- 
ever, if the transcription initiation rate of every 
gene were doubled, say by a global activator, 
protein concentrations could not possibly dou- 
ble: First, the synthesis of proteins is constrained 
by the translational capacity of the ribosomes 
(6, 7). Second, in the well-characterized model 
bacterium FE. coli, the total number of pro- 
teins per cell volume does not vary much across 
different growth conditions (fig. S2) (8, 9), 
which makes it impossible to change the con- 
centrations of most proteins in the same di- 
rection even if there is no constrain in gene 
expression capacity. Thus, the canonical single- 
gene picture of bacterial gene expression is 
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not necessarily compatible with global con- 
straints during broad changes in gene ex- 
pression, as they occur, for example, upon 
changes in nutrient conditions or exposure 
to antibiotics. These effects make it difficult 
to link the transcriptional and translational 
regulation to the concentrations of mRNAs 
and proteins. 

As the expression of each gene is ultimate- 
ly determined by the rates at which the re- 
spective mRNAs and proteins are synthesized 
and diluted (fig. S1), we designed a battery 
of experiments to determine these rates by 
measuring the absolute mRNA and protein 
concentrations and their fluxes for E. coli 
growing exponentially under various condi- 
tions. Our findings establish characteris- 
tics of promoters and mRNAs that defied 
our expectations and reveal design principles 
underlying E. coli’s gene regulation program, 
which enable the cell to allocate its proteome 
in accordance to functional needs while com- 
plying with cellular constraints. We established 
a simple, quantitative relation that connects 
gene regulatory activities to mRNA and pro- 
tein concentrations. 


Results 
Translation initiation rates are similar across 
mRNAs and growth conditions 


A proteomics workflow was developed to 
quantify the abundance of individual E. coli 
proteins (10) by combining the versatility of 
data-independent acquisition (DIA) mass spec- 
trometry (//, 12) and the accuracy of ribosome 
profiling (73). We determined the protein num- 
ber fractions y, ; = [P;]/[P] for >1900 proteins 
(labeled by 7), with [P] = >> [Pil being the 
total protein concentration, defined here as 
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the number of proteins per cell volume. RNA 
sequencing (RNA-seq) was used to determine 
the mRNA number fractions y,,, ; = [mR] /[mR] 
for the corresponding mRNAs, with [mR] = 
So [mei] being the total mRNA concentra- 
tion (see materials and methods). In both 
cases, our data showed high reproducibility 
(fig. S3, A to C). The result for EF. coli K-12 cells 
growing exponentially in glucose minimal me- 
dium is shown as a scatter plot of y, ; versus 
Wm; in Fig. 1B. We observed a strong corre- 
lation (r = 0.80) of mRNA and protein abun- 
dances along the diagonal (red line) across a 
vast range of abundances (1077 to 10-°). The 
histogram of wy, ;/Wm; is peaked around 1, 
with 50% of the genes within 1.7-fold (fig. S3E). 
We repeated the measurements for cells grow- 
ing exponentially in three types of growth- 
limiting conditions in minimal medium (carbon 
limitation, anabolic limitation, and transla- 
tional inhibition) (74, 75), with growth rates 
ranging between 0.3 hour” and 0.9 hour. 
A similar number of gene products were de- 
tected in these conditions, and the resulting 
scatter plots and histograms (fig. S3, F to S) 
looked very similar to those from bacteria 
grown in glucose minimal medium (fig. $3, 
Dand E). These results, summarized in Fig. 1, 
C and D, indicate that the fractional abun- 
dances of mRNA and proteins are approx- 
imately the same. That is 


Vani & Woa (1) 


for most expressed genes in all growth con- 
ditions tested. The strong correlation between 
mRNA and protein fractions is also supported 
but less emphasized in several recent quanti- 
tative studies of FE. coli protein expression 
(13, 16-18) (fig. $3, T to W). 

To probe how changes in protein and mRNA 
fractions relate to each other across growth 
conditions, we generated additional proteomics 
and transcriptomics datasets for more condi- 
tions under each type of growth limitation (fig. 
S4A) so that a smooth growth rate dependence 
could be obtained individually for the mRNA 
and protein fractions. We extrapolated these 
data to compute the fold change (FC) in the 
protein and mRNA fractions—FC(y, ;) and 
FC(y,,;), respectively—for each gene i (fig. 
S4B). The FC was calculated between the 
reference condition (wild-type cells grown 
in glucose minimal medium) and one with 
~3x slower growth for each of the three types 
of growth limitation imposed. Their ratio, 
FC(w,,;) /FC(Wm.i), Was even more tightly dis- 
tributed than y, ;/Wm, for each type of growth 
limitation (compare Fig. 1E with Fig. 1D), which 
indicated that the mRNA and protein frac- 
tions tightly covaried for most genes. The few 
exceptions that did not covary usually oc- 
curred in only one of the growth limitations 
and mostly corresponded to known targets of 
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of the basic processes determining mRNA and protein concentrations i 
exponentially growing bacteria. The rate of each process can potentially vary 


across both genes and conditions; the symbols used throughout th 
described alongside the respective cellular processes (fig. S1). (B) 
K-12 strain NCM3722 growing exponentially in glucose minimal mediu 


e st 


condition; growth rate, 0.91 hour™), the fractional number abundances of 


(Wp, Obtained from DIA/SWATH mass spectrometry) (10) and of mR 


As 


obtained from RNA-seq; see materials and methods) for each gene i are sh 
scatter plot (number of genes and Pearson correlation coefficient r in the figure). The 
red line represents the diagonal, pj = Wm. (C) The ratios of protein and mRNA 


n 


udy are 


For E. coli 
m (reference 


proteins 


(Wimir 
jown as a 


(materials and methods). The boxes and whiskers represent 50 and 90% of t 


ne 


genes, respectively; x-axis values give the corresponding growth rates. See tables 


S1 and S2 for lists of strains an 
for transcriptomics and proteo 
obtained in reference condition 


mics data. (D) Distributions of the ratios w, ;/ 


d conditions in this study and tables S3 and S4 


my 


and the slowest growing of each of the three types 


of limitations; same color code as in (C). The same plots also give the distributions 


of the relative translational initi 


ation rate, op; /Gp (see the text). (E) The FCs 


in 


protein and mRNA fractions for each gene i between the reference condition and 
the slowest growth condition, FC(y,,) and FC(y,,;), were computed as descr 


in fig. S4 for each one of the th 


ibed 
ee growth limitations; the distribution of their ratio 


fractions, Wp ;/Wm,, are distributed around 1 for exponentially growing cultures 
under all growth conditions studied (fig. S3, E to S). These include the reference 
condition (black) as well as conditions of reduced growth achieved by limiting 


posttranscriptional regulation (fig. S4, C to E, 
and table S5). 


Total mRNA abundance matches 
translational capacity 


From the steady-state relation between con- 
centrations of individual mRNAs and proteins 
(fig. S1) 

Op i[MRi] = ALP] (2) 
where a; is the translation initiation rate of 


each mRNA mR; and A denotes the growth 
rate, we can sum over contributions from all 
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genes to obtain a relation between the flux of 
total protein synthesis and dilution 


(3) 


with op = pts being the average trans- 
lational initiation rate (over all mRNAs). Be- 
cause the ratio of Eq. 2 and Eq. 3 gives 

Op i/Ap = Wpi/Vm.i (4) 
we see that Fig. 1D also provides the distri- 


bution of the relative translation initiation 
rates (Op; /Op). The observed similarity be- 
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FC(ypi)/FC(wm,) is shown using the same color code as in (C). The histograms 
are narrowly distributed around 1, with more than half of the genes within 35% 
from the median. See table S5 for the FCs in translation efficiency for each gene. 


tween mRNA and protein fractions (Eq. 1) 
implies that the translation initiation rates 
Op,; are similar for most mRNAs for each 
growth condition. Thus, the average transla- 
tional initiation rate G, can be taken as rep- 
resentative most mRNAs. 

Because the total protein concentration [P] 
changes <10% across nutrient-limited condi- 
tions (fig. S2H), the total protein synthesis flux 
ALP] changes almost linearly with the growth 
rate 2. Consequently, Eq. 3 constrains the total 
mRNA concentration [mR] and/or the aver- 
age initiation rate a) to change with cellular 
growth. To understand how this constraint 
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is accommodated, we quantified the total 
mRNA amount by combining hybridization 
of ?H-uracil-labeled RNA to genomic DNA 
with quantitative Northern blotting (fig. S5 
and materials and methods). The result for 
carbon-limited growth (fig. S5H) was then 
converted to cellular concentration, i.e., [mR] 
(supplementary text, note S1), and is shown 
as the red symbols in Fig. 2A (left vertical axis). 
These data allowed us to use Eq. 3 to obtain 
the average translation initiation rate a). The 
approximately linear growth rate dependence 
of [mR] makes dp only weakly growth rate 
dependent (fig. S6A, left axis). The value of 
@p in turn enabled us to obtain the distribu- 
tion of a, ;, the translational initiation rate 
of individual mRNAs, using Eq. 4 and the 
distributions of y, ;/Wm; (Fig. ID). The results 
for the reference and a slow, carbon-limiting 
growth condition exhibited weak dependence 
of dp; on both the mRNA species and growth 
condition (Fig. 2B and fig. S6B). 


Fig. 2. Coordination of mRNA and ribosome 
abundances. (A) (Left axis; red symbols) 
Total concentration of mRNA is plotted against 
the growth rate. Total mRNA abundance and 
associated standard deviations are based 

on three measurements obtained as described 
in fig. S5 and the materials and methods. 

The measurements were performed for a range 
of growth conditions, including reference, 
glucose uptake titration (Pu-ptsG; table S1), 
and a host of poor-carbon sources. (Right axis; 
gray symbols) Concentration of active 
ribosomes in nutrient-limited conditions, 
converted from the data in (7) (reported per 
culture volume) using the total cellular 
volume shown in fig. S2, C to E. (B) Translation 


initiation rates, ap; = Gp (Voi/¥mi). in 
eference (black) and carbon-limited (red) 
growth. (C) The spacing between consecutive 
translating ribosomes on an mRNA is given 

by the ratio between the ribosome elongation 
ate [similar across mRNAs; fig. S6 and (7)] 
and the translation initiation rate o;, which 

is also narrowly distributed [see (B)]. Our 
data give an average ribosome spacing of 

d = 200 nt (fig. S6D). RBS, ribosomal binding 
site. (D) Absolute mRNA and protein concen- 
tration for each gene in the reference condi- 
tion, computed by combining the fractional 
abundances yj; and yp); with total mRNA 
abundances (A), total protein abundances, and 
cell volume (fig. S2 and supplementary text, 
note S1). Blue lines indicate the corresponding 
values of interribosome spacing d, calculated 
from the known elongation rates (~15.3 aa s”). 
(E) Same as (D), but for slow growth in the 
most carbon-limiting condition [growth rate, 
~0.35 hour; elongation rate, ~12.4 aa s (7)]. 


protein concentration, 
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mRNA concentration, 


Constancy of ribosome spacing across mRNA 
and nutrient conditions 
To understand how the relation between the 
total mRNA concentration and the total pro- 
tein synthesis flux (Eq. 3) arises, we note that 
the total flux of peptide synthesis is given by 
MP]ép, where @p is the average length of a 
protein—~250 amino acids (aa) across con- 
ditions (fig. S2G). Because genome-wide trans- 
lation elongation rates are narrowly distributed 
(fig. S6C), this flux corresponds to the pro- 
duct of the concentration of actively trans- 
lating ribosomes ([Rd],,) and the speed of 
translational elongation (€) as depicted in fig. 
S6D. That is 
Pep = €[RO] act (5) 

Both [Rb],-t and e have been characterized 
for a broad range of nutrient conditions (7). 
The active ribosome concentration is plotted 
in Fig. 2A (right vertical axis). The data ex- 
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[Rb]act (10°/um?) 


hibit a congruence with the total mRNA con- 
centration, revealing a coordination of mRNA 
abundance and the cellular translational ca- 
pacity, with an average spacing between trans- 
lating ribosomes close to 200 nucleotides (nt) 
(Fig. 2C and fig. S6E). This translates to an 
average of number of ribosomes per mRNA 
r = 4 for typical mRNAs 750 nt long. The ob- 
served proportionality between total mRNA 
and active ribosome concentrations implies 
that the average translational initiation rates 
Op and elongation rates ¢ are proportional (fig. 
S6A); this proportionality extends to individual 
mRNAs because of the similarity in protein 
and mRNA fractions observed above. By com- 
paring the concentrations of protein and 
mRNAs (fig. S6, F and G), we confirmed that 
for most mRNA species, the ribosome spacing 
is clustered around 200 nt across conditions 
(Fig. 2, D and E, and fig. S6, H and I). The data 
are bounded by ~40 nt per ribosome, in accord 
with the physical packing limit (79). 
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mRNA degradation is largely 

condition independent 

We investigated the mechanism behind the 
observed proportionality between the concen- 
trations of total mRNA and active ribosomes 
(Fig. 2A). mRNA concentration is set by the 
balance between its synthesis and degradation 
(fig. S1). We performed kinetic experiments to 
determine mRNA degradation rates 5; genome- 
wide in the reference and the slowest carbon- 
limiting condition by inhibiting transcription 
initiation with rifampicin and quantifying the 
relative mRNA concentrations at short time 
intervals by RNA-seq (fig. S7, A to C; materials 
and methods; and table S3). As an example, 
we measured time courses of changes in the 
relative mRNA concentrations of genes of the 
nuo operon in the two growth conditions (Fig. 3, 
A to C). The time course can be described as a 
delayed exponential decay, with the lag time 
reflecting the time needed for the RNA poly- 
merase (RNAP) to reach the gene from the 
transcription start (fig. S7D) and the decay 
rate attributed to the turnover of that mRNA. 
This analysis yielded degradation rates for 
~2700 mRNAs (materials and methods and 
table S6). Genome-wide, mRNA degradation 
rates were strongly correlated in the two 
growth conditions (fig. S7E). The average deg- 
radation rate was very similar (fig. S7F, vertical 
dashed lines), even after weighting by mRNA 
abundances (fig. S7G). In particular, the FC 
in 6; is sharply peaked, with 90% of genes 
in the range of 0.50 to 1.57 (Fig. 3D). Overall, 
the data indicated a lack of dependence of 
degradation rates on either the mRNA spe- 
cies or the growth condition for most mRNAs, 
as observed in other studies (20, 27). The cases 
where rates changed significantly could be 
associated to known posttranscriptional regu- 
lation (fig. S7, H and I). 


Total mRNA synthesis flux is adjusted to match 
translational capacity 


From the concentration and degradation 
rates of each MRNA species—[mR,;] and 8;, 
respectively—we can obtain the mRNA degra- 
dation flux, 5,[7mR,], whose distributions are 
shown in Fig. 3E for the reference and slow 
growth conditions. A shift in the median of 
the two distributions is seen (vertical dashed 
lines), reflecting growth dependence of the 
total degradation flux, Yai [mR;] = 5[mRI, 
where 6= So bY, is the average degrada- 
tion rate across mRNAs. By the balance of 
mRNA synthesis and degradation in steady- 
state growth (fig. S1 and supplementary text, 
note S3), the total mRNA synthesis flux Jing 
can be expressed as 


Jnr => om [Gi] = 6 x [mR] (6) 


Because the average degradation rate (8) is 
affected little by growth conditions (fig. S7G), 
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Eq. 6 predicts that the observed growth depen- 
dence of the total mRNA concentration [mR] 
(Fig. 2A) is caused primarily by changes in 
mRNA synthesis flux Jnr. 

We tested this prediction by directly mea- 
suring the total mRNA synthesis flux J, across 
the range of carbon-limited growth conditions, 
by pulse-labeling cultures with *H-uracil and 
hybridizing the labeled RNA to genomic DNA 
over short time intervals (fig. S8). These data 
showed a strong growth rate dependence 
(Fig. 3F, red symbols, left vertical axis), closely 
matching the observed growth dependence of 
the total mRNA concentration (reproduced as 
black symbols in Fig. 3F, right vertical axis). 
The total mRNA fluxes inferred from the deg- 
radation rates, 5[mR] (orange crosses), were 
within 20% of the directly measured synthe- 
sis fluxes, which showcases the consistency 
of these two very different measurement 
approaches. Putting together the results in 
Figs. 2 and 3 shows that the global constraint 
(Eq. 3) is enforced primarily by matching total 
mRNA synthesis flux J,,;R With translational 
capacity (Fig. 3G). 


mRNA synthesis flux and transcriptional 
regulation 


The synthesis flux of each mRNA species is 
given molecularly by the product of the tran- 
scription initiation rate per gene, Om,;, and 
the gene concentration, [G;] (Fig. 1A and 
fig. S1). The growth rate dependence of gene 
concentration is in turn given by the product 
of the number of chromosome replication 
origins (Ori) per cell volume, [Ori], and the 
gene dose relative to the Ori, g; = [Gi]/[Ori]. 
Thus, the total mRNA synthesis flux can be 
expressed as 


Jmr = [Ori] x S| om Bi (7) 


The relative gene dose g; across growth rates 
can be obtained from the chromosomal posi- 
tion of the gene (22, 23) and the chromosome 
replication time (fig. $9, A and B). Further 
including a weak growth rate dependence of 
the Ori concentration (24) (fig. S9, C to E), 
we obtain negative growth rate dependences 
for the concentration of genes [G;] = [Ori] x g; 
at all chromosomal positions (Fig. 4A). It is 
then clear from Eq. 7 that the strong positive 
growth rate dependence seen for the total 
mRNA synthesis flux Jnr (Fig. 3F) cannot be 
accounted for by opposite growth rate depen- 
dences of gene concentrations and must involve 
systematic changes in the promoter activities 
Om,i- This was seen more explicitly by comput- 
ing the distributions of the promoter activity 
Om,i, obtained for each gene using the known 
degradation fluxes 6;[mR,] and gene concen- 
trations [G,] at steady state (eq. S3 in fig. S1; 
see also supplementary text, note S3, and data 
in table S6). The results (Fig. 4B) showed a 
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broad range of promoter activity, spanning 
four orders of magnitude, with the high end 
(~0.3 s ‘in reference condition) approach- 
ing the maximum of ~1 s‘ given the tran- 
scriptional elongation speed of ~50 nt s+ 
and a transcription elongation complex foot- 
print of ~40 nt (25-27). A shift is seen between 
the distributions for reference (gray) and 
carbon-limited (red) growth conditions. In 
fact, for most genes, Om,; in carbon-limited 
conditions decreases to about one-third of 
its value in the reference condition (Fig. 4C), 
a change that is comparable to the reduc- 
tion in growth rate. Thus, the coordination 
of mRNA synthesis flux with the growth rate 
(Fig. 3F) is likely a result of genome-wide 
changes in transcription initiation between 
these conditions. 

To look further into the determinants of 
transcription initiation, we used a canonical 
model of transcription regulation (Fig. 4D) 
(28, 29), where the transcription initiation 
rate Om; for gene 7 is given by the product 
of the concentration of available RNA poly- 
merases ([RNAP],,) and the promoter on rate 
k;. That is 


Omi = ki x [RNAP], (8) 


where k; captures the regulatory activities 
of all transcription factors acting on the 
promoter driving gene 7 (28, 29). Using this 
expression for o,, the balance of mRNA syn- 
thesis and degradation (eq. S3 in fig. S1) can be 
written as 


[RNAP], [Ori] x kigi = 5:[mRi] (9) 


Quantitative relations connect transcriptional 
regulation to gene expression 


From Eq. 9, we can derive two fundamen- 
tal relations connecting transcription reg- 
ulation to gene expression (supplementary 
text, note S4). Summing Eq. 9 over all genes, 
the balance of the total transcription flux 
becomes 

[RNAP],,[Ori] x K=8x [mR] (10) 
where kK = So isi describes the total on rate 
for promoters across the genome and is a 
measure of the total regulatory activity on 
transcription (weighted by gene dose). Given 
the proportionality between active ribosome 
and total mRNA concentrations, 7 = [RO],.../ 
[mR] = 4 (Fig. 2A), Eq. 10 can be written as 


[RNAP], [Ori] x K = (3/7) x [RO] act (11) 
This relation represents a fundamental con- 
straint between the overall transcription activity 
([RNAP].,,), the DNA content (via [Ori]), and 


the translational activity of the cell. 
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Fig. 3. mRNA degradation and 
synthesis. (A to C) Degradation 
of mRNA transcribed from 

the long nuo operon (A) in the 
eference condition (B) and 
carbon-limited condition (C). The 
abundance of mRNA was 
measured by RNA-seq over the 
course of 11 min after the 
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blockage of transcription initiation 
by rifampicin (materials and 
methods and fig. S7). Although 
the abundance of the mRNA 

of genes proximal to the promoter 
(nuoA, orange) drops immediately 
after rifampicin treatment (at 
time t = 0), a lag is observed 
for genes progressively more 
distant from the promoter (from 
orange to blue). The lag time 
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A degradation fluxes, [mR], computed from the mRNA concentration and degradation rates. These quantities should equate the mRNA synthesis fluxes, 


am,[Gi], in steady-state conditions. Dashed lines indicate the median fluxes, 0.194 um~? min’ in the reference condition and 0.108 um~? min” at slow growth. 
(F) (Left axis; red symbols) Total mRNA synthesis flux Jmr = doom lGi] (transcripts synthesized per cell volume per unit time) for a variety of growth 
conditions, as indicated (see table S2 for growth conditions). The slope of radiolabel incorporated into mRNA over time was used to obtain the mRNA synthesis 
flux, whereas the error bars represent the standard deviation from six measurements at different time points after the label addition (fig. S10). The 

orange crosses indicate the total mRNA synthesis flux obtained from summing 6[mR;] using the data in (E). (Right axis; black symbols) Absolute mRNA 
abundances (same data as Fig. 2A). (G) (Left axis; red symbols) Total RNA synthesis flux versus growth rate [same data as in (F)]. (Right axis; gray symbols) 
Concentration of active ribosomes (same data as Fig. 2A). 


Another important relation can be obtained 
by taking the ratio of Eqs. 9 and 10. Noting 
that the mRNA degradation rates are closely 
distributed around the average 6 and are in- 
dependent of growth conditions—i.e., 5; = 5 
(fig. S7, F and G)—we obtain kjg;/K = [mR;|/ 
[mR] = Wm 7. This relation extends further to 
the fractional protein abundances y, ; = 
[P;|/[P] because of the established relation 
between protein and mRNA fractions (Eq. 1 and 
Fig. 1), which leads to 


hi8i x ae ‘ 
K Vini ~ Vp 


(12) 
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This expression relates the (gene dose- 
weighted) regulatory activity on specific pro- 
moters (k;g;) to the mRNA and protein levels 
as determined by transcriptomics (Wm) and 
proteomics (w,,;). Notably, y, ; = [Pi]/[P] gives 
approximately the cellular protein concentra- 
tion [P;] because the total protein concentra- 
tion varies only mildly with the growth rate, 
on average [P] = 4.5 x 10° um“ (fig. S2H). 
Thus, Eq. 12 quantitatively connects regulatory 
activities at the promoter level (#;) to cellular 
protein concentrations [P;] without explicit 
reference to the macroscopic machineries of 
gene expression. Equations 11 and 12 are the 
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central quantitative results of this study. We 
suggest that Eq. 12 be viewed as a quantitative 
statement of the central dogma of bacterial 
gene expression, with Eq. 11 describing a 
system-level constraint on transcription and 
translation. In the following sections, we sep- 
arately explore some consequences of these 
two central relations. 


Global coupling in gene expression 


According to Eq. 12, the mRNA and protein 
fractions corresponding to a given gene 7 are 
dependent not only on the regulatory ac- 
tivity on that gene, k;g;, but also on the total 
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the cell, as illustrated in Fig. 4E. This depen- 
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gene expression across two different conditions 


FC([Pi]) = FC(wp;) = FC(higi)/FC(K) (13) 


In different growth conditions, where the 
promoter on rates ; of many genes are affected, 
we generally expect the total rate K to vary— 
ie., FC(X)#1. Consequently, changes in the 
regulatory activity of a gene are generally ex- 
pected to be different from the changes in the 
fractional abundances of the corresponding 
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Gj] at various distances x from the origin of replication Ori (solid li 
computed as the product of the Ori concentration [Ori] [orange ci 
in fig. S9C with raw data and standard errors from (24)] and the 
g = [Gi|/[Ori] (fig. SOB); see fig. S9 for details. (B) Distribution of transcription 
initiation rates om, in reference condition (black) and slow growth 
computed using the available mRNA abundances and degradation rates (see 

y text, note S3, for details). Dashed lines indicate the median 
initiation rates in the two conditions (2.64 min” for reference condition, 0.87 min+ 
for slow growth). (C) FC of the transcription initiation rates FC(am,) between 
eference condition and slow growth. The data show a generalized decrease 

of initiation rates, with a median reduction of 0.29 (dashed line) at slow growth 
“l) compared with the reference condition (4 = 0.91 hour”). 

of transcriptional regulation (28, 29), with 
the transcription initiation rate for gene i, Om,, depending on the promoter on 
is modulated by transcription factors (TF, TFo, ...), aS well as on 
le RNAPs ([RNAP],y), as described by Eq. 8. 
lustrating the dependence of mRNA and protein abundances on 
by Eq. 12. Consider two genes with promoter 
identical gene concentrations [G;] = [G2 = [G]: 
mRNA and protein fractions (Wm1 = Wp1= Wi and Wn2 = 
ively) depend on both promoter on rates through the total 
regulatory activity K = (k, + k2)[G] (in red). Three possible scenarios are 
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rcles; shown 
gene dose 
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not being down-regulated at the transcriptional level. (Bottom) If 


only k; decreases while kz remains constant, then the proteins and mRNAs 


for the blu 
(Middle) If 


case), the 
at the reg! 


e gene increase despite the lack of change at its promoter level. 

K is unchanged (because of compensating changes in k; and kz in this 
n the changes in protein and mRNA fractions would reflect changes 
ulatory level. (F) E. coli strains harboring constitutive expression 


of lacZ at 


density x m 


abundance 


mRNA and protein. In fact, the latter might 
change even if the corresponding regulatory 
activity k,g; is unchanged because of the 
overall change in regulatory activity K (illus- 
trated in Fig. 4E). 

To determine how the total regulatory ac- 
tivity K may change across growth conditions, 
we returned to the spectrum of carbon-limited 
growth conditions. The growth rate depen- 
dence of K can be deduced by applying the 
relation (Eq. 13) to constitutively expressed 
(i.e., unregulated) genes, for which x; is con- 
stant. For this purpose, we inserted constitu- 
tively expressed lacZ genes at various locations 
x on the chromosome, with known gene doses 
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various locations near oriC (orange) and near terC (blue; loci listed in 
the legend) were grown in carbon-limited conditions (see tables S1 and S2 for 
strains and conditions). LacZ protein abundance per culture volume (optical 
illiliter), obtained from the slopes of B-gal activity versus OD¢o0 
(optical density of a sample measured at a wavelength of 600 nm) (Mi 
is shown; error bars indicate standard errors from four measurements 
and methods). a.u., arbitrary units. (G@) The relative change in the total 
regulatory activity K across growth rates was estimated from the relative change 
in LacZ abundance using the data in (F) and Eq. 14. To do so, the LacZ 
per culture volume was converted to protein fraction by dividing by 
total protein mass per culture volume (fig. S2F). The result shows a linear 
dependence of the total regulatory activity on the growth rate (red line). The 
absolute scale K was set for the reference condition using Eq. 10 with the 
values for the total mRNA synthesis flux Jina obtained from Fig. 3F, the oriC 
concentration from Fig. 4A, and the available RNAP concentration estimated as 
described in the supplementary text, note S5. 


ller units), 
(materials 


g(a) (Fig. 4A). The total rate K can then be 
obtained by measuring the concentration of 
LacZ, [LacZ(x)], for different growth rates as 


8(2) 


[LacZ(a)] ce 


The data in carbon-limited conditions (Fig. 
4F) show that [LacZ(x)] generally decreased at 
faster growth, with a steeper trend when lacZ 
was inserted near ter (blue) compared with 
when it was inserted near o77 (orange). Upon 
calculating K using Eq. 14, the data collapsed 
on the same positive growth rate dependence 
(Fig. 4G). Although this set of experiments 
established the relative changes in K across 
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conditions, the absolute scale of K can be 
determined from Eq. 10 using the measured 
[Ori] (Fig. 4A) and the measured Jing (Fig. 3F) 
for 5[mR] (Eq. 5). As discussed in the supple- 
mentary text, note S5, the abundance of avail- 
able RNAP, [RNAP],,, can be estimated in the 
reference condition to be ~1300 um ®, leading 
toX ~ 127 um? min‘ in the reference condition. 


Identifying promoter on rates 


Knowledge of the magnitude of K, together 
with the mRNA abundances and degradation 
rates, allowed us to compute the promoter 
on rates k; for each gene 7 across growth 
conditions (supplementary text, note S4). The 
results for ~2500 genes (Fig. 5A) displayed 
a broad distribution across more than three 
orders of magnitude. Because k; values were 
computed by combining several different 
genome-wide measurements (supplementary 
text, note 4), we sought to validate them by 
examining the on rates of genes belonging to 
the same operon to see whether the variation 
in the k; of these cotranscribed genes are in 
fact small, as intuitively expected. The co- 
efficients of variation in k; within operons 
were not only much smaller than the values 
obtained by randomly extracting k; from the 
different operons but were also significantly 
smaller than the variation in the concentra- 
tions of proteins associated to the same regu- 
latory units (Fig. 5B and fig. SIOA). 

The complete set of gene expression rates 
generated in this work, including the pro- 
moter on rates, mRNA degradation rates, 
and translation initiation rates (table S6), al- 
lowed us to investigate, at the genome scale, 
the dominance of different factors control- 
ling protein concentrations in FE. coli, as well 
as their changes across conditions. By plot- 
ting the promoter on rates #;; against the pro- 
tein concentrations in the reference condition 
(Fig. 5C, top), we see that the large range of 
protein concentrations can be largely attri- 
buted to differences in the promoter on rates, 
as opposed to other factors such as mRNA 
degradation rate, translation initiation rate, or 
the gene dose (other panels in Fig. 5C), which 
is in agreement with the simple scenario ex- 
pressed by Eq. 12. 

Proteins present at low concentrations tend 
to have lower translation initiation rates ay ; 
and larger mRNA degradation rates 6; com- 
pared with those at high concentrations (mid- 
dle panels in Fig. 5C and fig. S10B); both effects 
tend to reduce the average number of pro- 
teins produced during the mRNA’s lifetime 
(3) (fig. S10, C to F). When focusing on co- 
transcribed genes, we were also able to iden- 
tify posttranscriptional effects (fig. S11, A and 
B) and evidence of premature transcriptional 
termination (fig. S11C). Still, these effects are 
rare and of small magnitude compared with 
the vast range of promoter on rates. 
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Unraveling the innate and regulatory effects on 
gene expression 

FCs in protein abundance across conditions 
showed an almost perfect correlation with 
promoter on rates (Fig. 5D, top), whereas the 
effect of posttranscriptional regulation or gene 
copy number was negligible (other panels in 
Fig. 5D and fig. S10G). Thus, protein concen- 
trations appear to be adjusted across condi- 
tions by modulating the promoter on rate, as 
described by Eq. 13. Yet, the typical changes in 
promoter on rates were very small (Fig. 5D), 
even though the full range of k; spans more 
than three orders of magnitude (Fig. 5C and 
fig. S10, H and I). Based on proteomics data 
collected for a wide range of growth condi- 
tions from Mori et al. (10) (fig. S12, A to E), 
we find that that protein abundances for 
two-thirds of the genes vary by less than five- 
fold (fig. S12F) even though the absolute abun- 
dances vary by more than three orders of 
magnitude. Particularly small changes are 
observed for proteins engaged in housekeep- 
ing activities (fig. S12, G to I, and fig. S11, D to 
F) or those encoded by essential genes (fig. 
$12, F and J). Taken together, we conclude 
that innate promoter sequences determine 
the typical concentrations of most proteins, 
with transcriptional regulation providing 
fine adjustments depending on conditions. 
Some exceptions involve proteins needed in 
specific growth conditions (e.g., TCA pro- 
teins in aerobic conditions), whose genes 
are under strict transcriptional control (fig. 
S12, K to O). 

Overall, promoter on rates for genes in- 
volved in protein synthesis increased with 
growth rate and largely accounted for the 
increase in the total promoter on rate K (Fig. 
5E). Changes in k; values for ribosomal pro- 
teins and elongation factors were similar, as 
expected, because they are largely cotran- 
scribed (Fig. 5F, triangles, and fig. S13, A to D). 
Notably, they also matched the changes for 
the ribosomal RNA (rRNA) promoters (Fig. 
5F, crosses). Thus, despite the known post- 
transcriptional regulation acting on ribosomal 
proteins (30), the coordination between ribo- 
somal proteins and rRNAs is largely imple- 
mented by matching their promoter on rates. 
By contrast, the #; values of other translation- 
affiliated proteins (e.g., initiation factors and 
tRNA synthetases) in different regulatory 
units present a variety of dependencies on 
the growth rate (fig. S13, E to H). Analogously, 
we found a variety of behaviors for the #; of 
genes known to be controlled by cyclic adeno- 
sine 3',5'smonophosphate (cAMP)-cAMP recep- 
tor protein (CRP) and typically expressed in 
carbon-poor conditions (15) (fig. $14). The 
strong increases in the protein levels of many 
catabolic proteins in carbon limitation are the 
result of a combination of faster promoter on 
rates and reduced total regulatory activity 
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at slow growth, which highlights the passive 
effects described in Fig. 4E. 


Control of global mRNA synthesis by the 
anti-sigma factor Rsd 


The constraint between transcription and 
translation (Eq. 11) must be somehow satisfied 
despite the observed changes in active ribo- 
somes (Fig. 2A), DNA concentration (Fig. 4A), 
and total regulatory activity (Fig. 4G). The 
combination [Ori] x X has only a moderate 
dependence on growth rate (Fig. 6A). Instead, 
the concentration of available RNAP, esti- 
mated as the ratio of the mRNA synthesis 
flux (nr) and [Ori] x K based on Eqs. 6 and 
10, exhibited a stronger growth rate depen- 
dence (Fig. 6B, left axis), approximately match- 
ing that of the concentration of active ribosomes 
(Fig. 6B, right axis). 

A simple mechanism to change the availa- 
bility of RNAPs is to change the abundance of 
the transcription machinery itself. However, 
our quantitative proteomics data showed that 
the cellular concentrations of RNAP compo- 
nents, including the housekeeping factor o”° 
(encoded by the gene rpoD), are all constant 
across the growth rate range studied (Fig. 6C, 
squares). We checked expression levels of the 
two known modulators of o” function, 6S RNA 
(31, 32) and the anti-o” protein Rsd (33-35). 
Although the concentration of 6S RNA is 
~1/100 of that of o” (Fig. 6C, gray triangles) 
and thus is unlikely to affect the global tran- 
scription flux in these conditions, the Rsd con- 
centration rose to that of o” as growth rate is 
reduced (Fig. 6C, red triangles). Thus, Rsd 
could be a regulator of global transcription 
by sequestering o”° during exponential growth 
(Fig. 6D), even though it is thought to have its 
primary role in the stationary phase (33, 36). 
We tested this scenario by characterizing the 
total mRNA synthesis flux in a A7sd strain. 
mRNA synthesis flux became nearly indepen- 
dent of growth rate (Fig. 6E, filled red circles), 
exceeding that of the wild-type strain (open 
red circles), especially at slow growth when 
Rsd is highly expressed. Without rsd, the 
synthesis flux was no longer matched to the 
translational capacity (filled symbols) in con- 
trast to the tight matching observed in the wild- 
type strain (open symbols). Concomitantly, the 
Arsd strain exhibited a growth defect that was 
proportional to the amount of Rsd expression 
in wild-type cells in slow growth conditions 
where Rsd is expressed (Fig. 6F). Given the 
approximate constancy of mRNA turnover 
across growth conditions for wild-type cells 
(fig. S7, E to G), we propose that Rsd may have 
a central role in controlling total mRNA con- 
centrations (Fig. 2A and Fig. 3F). 


Discussion 


We used comprehensive transcriptomic and 
proteomic studies, complemented by quantitative 
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Fig. 5. Gene expression is primarily determined by 
the promoter on rates. (A) Distribution of promoter on 
ates k; in the reference and slow growth conditions, 
obtained from the distribution of the translation 
initiation rate and the concentrations of available RNAP, 
kj = Omi /[RNAP],, (Eq. 8), as described in the supple- 
mentary text, note S4. The median promoter on rate 
(vertical dashed lines) shifts from 1.63 x 10-4 wm? min 
in the reference condition (4 ~ 0.9 hour™') to 1.07 x 
04 um? min in slow growth (a ~ 0.3 hour”). This 
change is much less than the approximately threefold 
change in both the growth rate and the median 
transcription initiation rates (Fig. 4, B and C). (B) For 
71 operons containing at least three genes as annotated 
in Ecocyc (53), we computed the coefficient of variation 
(CV) in the promoter on rates k; or in the protein 
concentrations [P;] for genes within each operon in the 
reference condition. The average intraoperon CVs for 
the promoter on rates are significantly smaller than that 
computed for the protein concentrations [P;] (P < 

7 x 10°’, unpaired t test) (fig. SIA). As a control, we 
randomly shuffled the genes across the operons 50 
times, leading to sets of 3550 CVs (gray-filled boxes), 
and considered the CVs computed using all available 
genes (lines on the right). The CVs for the promoter on 
rates are also significantly smaller than those for all 
the other distributions (P < 3 x 10° when comparing 
with the randomized cases) and the genome-wide CVs. 
Boxes and whiskers indicate 50 and 90% intervals, 
respectively; median CVs are indicated by the central 
lines within the boxes. (C) Promoter on rates ki, 
translation initiation rates ap j;, MRNA degradation rates 
8;, and gene concentrations [Gj] are the four molecular 
parameters determining cellular concentration of a 
protein in a given growth condition (Fig. 1A; with the 
transcription initiation rate an, given by k; through 

Eq. 8). These four molecular parameters are plotted 
against the protein concentrations [P;] in the reference 
condition, binned according to the observed protein 
concentrations. Boxes and whiskers indicate 50 and 
90% central intervals for the binned data; the solid lines 
represent moving averages. (D) Same as (C), but for 
the FCs of each quantity across growth conditions 
(slow growth compared with reference). All molecular 
parameters and concentrations shown in (A) to (D) are 
listed in table S6. (E) The sum of promoter on rates 
weighted by gene dose, K = kei (red line; same as 
in Fig. 4G) is partitioned between the contribution from 
ribosomal proteins and translation elongation (Rb + el.) 
factors (green) and the rest of genes (gray area). 
Symbols indicate the partitioning obtained from the 
computed k; across growth rates. The growth rate 
dependence of K largely stems from that of the 
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methods and fig. S14). 


measurement of total mRNA abundance and | S3toS5). The results revealed two simple rules 
transcription flux, to determine the absolute | on promoter and mRNA characteristics, which 
mRNA and protein abundances, the mRNA | profoundly shape how E. coli responds to 
degradation rates, and the promoter on rates | environmental changes while coping with 
for >1500 genes in EF. coli for many growth | global constraints: (i) Promoter on rates span 
conditions during steady-state growth (tables | more than three orders of magnitudes across 
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genes but vary much less (at most approxi- 
mately fivefold) across conditions for most 
genes. Thus, each gene is expressed within an 
innate abundance range across conditions— 
e.g., with ribosomal genes belonging to the 
most abundant and DNA replication proteins 
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mRNA synthesis. (A) Value of x [Ori] across growth rates, obtained from the 
values (data and standard errors) of the total regulatory activity shown in 
Fig. 4G, multiplied by the interpolated values for [Ori] at the same growth rates 
(Fig. 4A). For comparison, the dashed line shows direct proportionality to the 
growth rate. (B) Concentration of available RNAPs (red symbols, left axis), 
estimated from the ratio between the measured mRNA synthesis flux (data and 
errors in Fig. 3G) and K x [Ori] (using the interpolated curves in Fig. 4, A and G). 
This quantity shows a stronger dependence on the growth rate compared 

with x [Ori] in (A) and has the same growth rate dependence as the concentration 
of active ribosomes (gray symbols, right axis). (©) The concentrations of 
various components of the transcription machinery in carbon-limited conditions 
are plotted against the growth rate. Components of the core enzyme, RpoABC, 
and the major sigma factor o’° are shown as squares. Known modulators of 07°, 


is determined from RNA-seq and the concentration of total mRNA concentration 
(fig. S4). (D) Cartoon illustrating the control of RNAP availability through the 
known o/°-sequestration function of Rsd (33, 54). (E) Comparison of mRNA 
synthesis fluxes between wild-type (WT) (open symbols) and Arsd strain (filled 
symbols). (Left axis) Total mRNA synthesis flux of Arsd strain (red filled circles) 
and wild-type (red open circles); standard errors are computed as in Fig. 3F. 
(Right axis) Concentration of active ribosomes computed from the measured 
total RNA for the two strains and the fraction of active ribosomes observed in 
carbon-limited growth (7). (F) The growth defect of the Arsd strain, defined 

as the percent reduction in growth rate compared with that of wild-type cells in 
the same growth condition (black circles, left axis), is plotted against the growth 
rate of wild-type cells for the range of carbon-limited growth conditions. The 
observed growth reduction matches Rsd expression of wild-type cells in the 


Rsd and 6S RNA, are shown as triangles. The protein concentrations are 


belonging to one of the least abundant classes. 
(ii) mRNA characteristics, including transla- 
tion initiation rate and mRNA degradation 
rate, vary little (less than twofold for half of 
the genes) across genes and conditions. The 
translation initiation rates are sufficiently 
rapid to maintain a high density of ribosomes 
on the mRNA (five ribosomes per kilobase; 
Fig. 2A and fig. S6E), resulting in high protein 
production despite short mRNA half-lives. The 
rules governing promoter and mRNA charac- 
teristics deduced here dictate, to a large extent, 
E. col’s strategy to implement gene regulation 
while complying with the constraints on total 
protein concentration and a limited translation 
capacity. This can be cast into two design prin- 
ciples of gene expression, as summarized below. 


Global coordination between transcription 
and translation 


Concentrations of translating ribosomes are 
known to increase linearly with the growth 
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rate of the bacteria (Fig. 2A). Because of the 
constant density of translating ribosomes on 
most mRNA, the total mRNA concentration must 
also scale similarly. As the total mRNA pool is 
specified by the total mRNA synthesis rate 
(given the constant mRNA turnover rate across 
conditions), total mRNA synthesis is balanced 
with translating ribosomes concentration—a 
crucial condition captured by Eq. 11. We refer to 
this balance as the principle of transcription- 
translation coordination. We showed that FE. coli 
implements this coordination across nutrient 
conditions primarily by adjusting the availa- 
ble RNAP concentration through the anti- 
o”° factor Rsd (Fig. 6, D to F). 

If this coordination is broken, as in the case 
of the Arsd mutant (Fig. 6E), then rule 2 can- 
not hold as long as the constraints on trans- 
lation capacity and protein density hold. An 
oversupply of mRNA with respect to ribosomes 
is expected to decrease the rate of translation 
initiation (because of competition for limited 
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same conditions [red triangles, right axis; same data as in (C)]. 


ribosomes) and increase the rate of mRNA 
degradation [because of reduced protection 
of mRNA by elongation ribosomes against 
ribonuclease (RNase) activity] (37, 38). Aside 
from the futile cycle involving the synthesis 
and degradation of mRNAs and affecting 
growth (Fig. 6F), breaking rule 2 would com- 
plicate the otherwise simple relation between 
transcriptional regulation and protein concen- 
trations of the wild-type system. 


The predominant role of transcriptional control 
in setting protein concentrations 


The similarity of mRNA characteristics (rule 2) 
together with the vast disparity of promoter 
characteristics (rule 1) across genes in a given 
condition implies that protein abundances are 
predominantly set by the promoter characteris- 
tics, specifically, the promoter on rates. Further- 
more, because mRNA characteristics remain 
similar across different growth conditions 
(rule 2), changes in protein concentrations 
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across conditions must arise primarily from 
changes in the promoter on rates—i.e., through 
transcriptional regulation (Fig. 5D). We refer 
to this strong effect of transcription on gene 
expression as the principle of transcriptional 
predominance. 

The strong mRNA-protein correlations are 
at odds with early studies on bacterial gene 
expression (39-45). These differences may orig- 
inate in part from technological advances in 
transcriptomics and proteomics over the years 
(0, 45, 46). Further, given the very different 
turnover rates of proteins and mRNAs, accu- 
rate mRNA-protein comparison requires en- 
suring steady-state conditions, which was a 
crucial component of our experimental design 
(materials and methods). We note that strong 
mRNA-protein correlation, similar to what we 
report here, was also contained in a number 
of recent datasets (13, 16-18) (fig. S3, T to W). 
However, such comparisons were not articu- 
lated as the main messages in these studies 
because their focus was on the variability in 
translational characteristics among mRNAs. 
Although we also see such differences (fig. S4, 
C to E), our data [as well as those from (/4-17)] 
show that such variabilities do not represent 
the general behavior of most genes. 

Although setting protein concentrations 
transcriptionally appears simple, a quantita- 
tive relation between promoter on rates and 
protein concentrations is complicated by the 
fact that the total protein concentration does 
not vary by much, despite strong changes in 
total transcriptional activity. Because protein 
concentrations do not depend on the absolute 
flux of the corresponding mRNAs but rather 
on the fraction of the total transcription flux, a 
hypothetical doubling of all promoter on rates, 
as we alluded to above, will have no effect on 
mRNA and protein concentrations. Further- 
more, the relations between promoter on rates 
and protein concentrations are insulated from 
growth-dependent differences in shared ma- 
chinery, such as RNAPs and ribosomes. 


Global transcriptional coupling and its 
consequences 


Because the protein output of a given pro- 
moter depends on the total regulatory activity 
K in Eq. 12, nonintuitive relations between 
promoters and protein concentration can arise 
when K changes across conditions. The latter is 
likely whenever there is a substantial change in 
growth conditions—e.g., because of changes in 
the on rates of ribosomal genes (fig. S12H and 
Fig. 5H). Hence, it is generally incorrect to infer 
regulatory activities directly from changes in 
mRNA and protein concentrations (Fig. 4E). 
This effect of global coupling that reflects the 
passive regulatory effect would hardly affect 
the result of most classical studies, which typ- 
ically have involved large changes in the out- 
put of a few individual promoters. However, 
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for most genes whose expression changed 
moderately across conditions (e.g., less than 
twofold for 60% of the genes under carbon- 
limited growth; see Fig. 5D), the effect of global 
coupling would be substantial. 

Our work provides a quantitative framework 
to distinguish gene-specific regulatory effects 
from global interactions in gene expression 
studies. Knowing the promoter on rates #; for 
individual genes facilitates a direct, promoter- 
centric view of regulation across conditions at 
the genome scale (Fig. 5). This improves upon 
previous estimates of promoter activity based 
on protein synthesis fluxes (47), which mix 
systemic effects, such as RNAP availability, 
with gene-specific regulatory effects (supple- 
mentary text, note 4). These results would 
thus be of use in deciphering the behaviors 
of endogenous genetic circuits and in guiding 
the design of synthetic circuits in variable 
growth conditions (48-50). 

The results described here are specific to 
bacteria. Eukaryotic gene expression involves 
complex posttranscriptional regulation, includ- 
ing protein secretion (57) and degradation 
through ubiquination and autophagy (52). 
Global constraints are less understood, in par- 
ticular the extent to which protein density may 
vary across conditions. Even quantifying the 
cell volume may be difficult because large por- 
tions within a cell may be occupied by sub- 
cellular compartments (e.g., vacuoles) that do 
not contribute to the cytosol. Nonetheless, 
our study provides a framework to quantita- 
tively explore gene expression in such complex 
systems. 


Materials and methods summary 


Experimental methods for cellular growth; 
RNA-seq; quantification of mRNA abundance, 
synthesis fluxes, and degradation rates; and 
numerical and statistical methods are reported 
in the supplementary materials. 
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Human-level play in the game of Diplomacy by 
combining language models with strategic reasoning 
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Despite much progress in training artificial intelligence (Al) systems to imitate human language, 
building agents that use language to communicate intentionally with humans in interactive environments 
remains a major challenge. We introduce Cicero, the first Al agent to achieve human-level performance 
in Diplomacy, a strategy game involving both cooperation and competition that emphasizes natural 
language negotiation and tactical coordination between seven players. Cicero integrates a language 
model with planning and reinforcement learning algorithms by inferring players’ beliefs and intentions 
from its conversations and generating dialogue in pursuit of its plans. Across 40 games of an anonymous 
online Diplomacy league, Cicero achieved more than double the average score of the human players 
and ranked in the top 10% of participants who played more than one game. 


major long-term goal for the field of 
artificial intelligence (AI) is to build 
agents that can plan, coordinate, and 
negotiate with humans in natural lan- 
guage. Although much progress has 
been made in language models that imitate 
human language (1), effective negotiation 
agents must go beyond this by understand- 
ing the beliefs, goals, and intentions of their 
partner; planning joint actions that account 
for their partner’s goals; and persuasively and 
intentionally communicating these proposals. 
We present Cicero, an AI agent that achieved 
human-level performance in the strategy game 
Diplomacy. In Diplomacy, seven players con- 
duct private natural language negotiations to 
coordinate their actions to both cooperate and 
compete with each other. By contrast, prior 
major successes for multi-agent AI have been 
in purely adversarial environments such as 
chess (2), Go (3), and poker (4), in which com- 
munication has no value. For these reasons, 
Diplomacy has served as a challenging bench- 
mark for multi-agent learning (5-8). 
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Cicero couples a controllable dialogue mod- 
ule with a strategic reasoning engine. At each 
point in the game, Cicero models how the other 
players are likely to act on the basis of the game 
state and their conversations. It then plans how 
the players can coordinate to their mutual ben- 
efit and maps these plans into natural language 
messages. 

We entered Cicero anonymously in 40 games 
of Diplomacy in an online league of human 
players between 19 August and 13 October 2022. 
Over the course of 72 hours of play that in- 
volved sending 5277 messages, Cicero ranked 
in the top 10% of participants who played more 
than one game. 


Challenges of human-Al cooperation 
in Diplomacy 


Almost all prior AI breakthroughs in games have 
been in two-player zero-sum (2p0s) settings, in- 
cluding chess (2), Go (3), heads-up poker (9, 10), 
and StarCraft (11, 12). In finite 2p0s games, 
certain reinforcement learning (RL) algorithms 
that learn by playing against themselves—a 
process known as self-play—will converge to 
a policy that is unbeatable in expectation in 
balanced games (73). In other words, any finite 
2pOs game can be solved through self-play with 
sufficient compute and model capacity. 
However, in games that involve cooperation, 
self-play without human data is no longer 
guaranteed to find a policy that performs well 
with humans, even with infinite compute and 
model capacity, because the self-play agent may 
converge to a policy that is incompatible with 
human norms and expectations. This effect 
can be clearly seen in settings that involve 


language, in which prior work found that 
self-play produced uninterpretable language 
despite achieving high task success for the 
agents (14, 15). Even in dialogue-free versions 
of Diplomacy, we found that a self-play algo- 
rithm that achieved superhuman performance 
in 2p0s versions of the game performed poorly 
in games with multiple human players owing 
to learning a policy inconsistent with the norms 
and expectations of potential human allies 
(16, 17). Thus, a major challenge in Diplomacy 
is to develop a way to harness the potential 
benefits of self-play in a way that leads to 
human-compatible language and behavior. 

The challenge of maintaining human- 
interpretable communication is particularly 
acute in Diplomacy, in which our agent sent 
and received an average of 292 messages per 
game (fig. S8). Messages in the game often 
involve coordinating precise plans, and any 
miscommunication can result in their fail- 
ure. Each message an agent sends must be 
grounded in (be contextually appropriate and 
consistent with) lengthy dialogue histories, 
game states—including proposed hypothetical 
states—and goals. If messages are inaccurately 
grounded, humans may ask the agent to ex- 
plain its errors (a challenging task that may 
lead to further mistakes) or choose to coop- 
erate with others instead. Further, repeated 
messaging creates feedback loops, in which 
the language model imitates the style of its 
own previous messages—for example, send- 
ing a short or incoherent message will increase 
the likelihood of such messages in the future 
(18). Past work on strategic dialogue systems 
has avoided these issues by focusing on sim- 
pler settings (14, 19-21), which involve only 
a single human partner, shorter dialogue his- 
tories, and simpler strategies. 

Last, Diplomacy is a particularly challeng- 
ing domain because success requires building 
trust with others in an environment that en- 
courages players to not trust anyone. Each 
turn’s actions occur simultaneously after non- 
binding, private negotiations. To succeed, an 
agent must account for the risk that players 
may not stay true to their word, or that other 
players may themselves doubt the honesty of 
the agent. For this reason, an ability to reason 
about the beliefs, goals, and intentions of others 
and an ability to persuade and build relation- 
ships through dialogue are powerful skills in 
Diplomacy. 


The game of Diplomacy 


Diplomacy is a board game in which seven 
players compete to control supply centers (SCs) 
on a map, by moving their units into them. A 
player wins by controlling a majority of SCs. 
The game may also end when all remaining 
players agree to a draw, or a turn limit is 
reached, in which case scores are determined 
by the number of SCs each player controls. 
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Board state & history 


Planning 


Output action 


Joint action 


Policies 


=a -~ 
fo) 
(all players) 
State value ( 


o Simulator 


Future state 


Dialogue-free value model (from RL) 


Dialogue 


AUSTRIA: Hi Italy! Care to work 
together on this one? If you 

support me into BOH | think we'd 
both be able to grow quickly. 


ITALY: Could you support me 
into BUL in return? 


Dialogue history 


Dialogue 
model 


>) Dialogue-conditional 
action model 


Message candidates 
= 


Message generation 


Intents 


AUSTRIA: VIE > BOH, ... 
ITALY: TYR S VIE BOH, ... 


Anchor policies 
(all players) 


AUSTRIA:Hi Italy! Care to work 
together on this one? If you 

support me into BOH | think we'd 
both be able to grow quickly. 


ITALY: Could you support me 
into BUL in return? 


Filters 
(nonsense, 
grounding, value) 


AUSTRIA: Sure thing! | have 
ordered SER to support GRE 
to BUL. 


Output message 


Fig. 1. Architecture of Cicero. Cicero predicts likely human actions for each player according to the board 
state and dialogue, using that as the starting point for a planning algorithm using RL-trained models. 

The output of planning is an action for the agent as well as beliefs about other players’ actions, which are 
used to select intents for a dialogue model to condition on. Generated message candidates undergo several 


filtering steps before a final message is sent. 


Each turn, all players engage in private pair- 
wise free-form dialogue with the others during 
a negotiation period, and then all players simul- 
taneously choose an action comprising one 
order per unit they control. A unit may sup- 
port other units, including those of another 
player, which forms the basis for much of the 
negotiation in Diplomacy. A detailed descrip- 
tion of the rules is provided in the supplemen- 
tary materials (SM), materials and methods, 
section C. 


Overview of Cicero 


At a high level, Cicero combines a dialogue 
module with a strategic reasoning module, 
along with a filtering process to reject low- 
quality messages. A diagram of Cicero is pro- 
vided in Fig. 1. 


Dialogue 


Cicero generates dialogue using a pretrained 
language model that was further trained on 
dialogue data from human games of Diplomacy. 
Crucially, in addition to being grounded in 
both the dialogue history and game state, the 
dialogue model was trained to be controllable 
through intents, which we here define to be a 
set of planned actions for the agent and its 
speaking partner. This was accomplished by 
automatically augmenting the human data 
with inferred intents and using this informa- 
tion as further conditioning during training. 
For example, intents showing the agent moving 
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into the territory Bulgaria (““BUL”) with sup- 
port from its speaking partner might yield a 
message such as “Could you support me into 
BUL in return?” Grounding in intents re- 
lieved the dialogue model of most of the re- 
sponsibility for learning which actions were 
legal and strategically beneficial. In particu- 
lar, this control provided an interface be- 
tween the dialogue generation and strategic 
reasoning. 


Strategic reasoning 


Cicero uses a strategic reasoning module to 
intelligently select intents and actions. This 
module runs a planning algorithm that pre- 
dicts the policies of all other players on the 
basis of the game state and dialogue so far, 
accounting for both the strength of different 
actions and their likelihood in human games, 
and chooses an optimal action for Cicero that 
is based on those predictions. Planning relies 
on a value and policy function trained through 
self-play RL that penalized the agent for de- 
viating too far from human behavior, to main- 
tain a human-compatible policy. During each 
negotiation period, intents are receomputed 
every time Cicero sends or receives a message. 
At the end of each turn, Cicero plays its most 
recently computed intent. 


Message filtering 


Cicero passes each generated message through 
several filters designed to limit messages that 


are nonsensical, inconsistent with intents, or 
strategically poor. 


Methods 
Data 


We obtained a dataset of 125,261 games of 
Diplomacy played online at webDiplomacy.net. 
Of these, 40,408 games contained dialogue, 
with a total of 12,901,662 messages exchanged 
between players. Player accounts were de- 
identified, and automated redaction of per- 
sonally identifiable information (PII) was 
performed by webDiplomacy. We refer to this 
dataset hereafter as WebDiplomacy. 


Intent-controlled dialogue 


Cicero generates messages through a neural 
generative Diplomacy dialogue model that 
was trained to be controllable through a set 
of intents. 


Imitation dialogue model 


We took R2C2 (22) as our base model—a 
2.7 billion-parameter Transformer-based (23) 
encoder-decoder model pretrained on text 
from the internet by using a BART denoising 
objective (24). The base pretrained model 
was then further trained on WebDiplomacy 
(Methods, Data) through standard maxi- 
mum likelihood estimation. Specifically, with 
a dataset D = {|x ,y]}, the model was 
trained to predict a dialogue message y? from 
player A to player B at time ¢, given all of the 
following represented as text x: dialogue 
history (all messages exchanged between 
player A and the six other players up to time 0); 
game state and action history (current game 
state and recent action history); player rating 
(rating for A corresponding to Elo rating 
computed from games in WebDiplomacy); 
game and message metadata (additional info 
about game settings and the current message, 
such as time since the last message, and current 
turn). Additionally, the model conditions on 
intents (a set of proposed actions for players A 
and 86 for the current turn and future turns, 
representing the intent for message y®). Fur- 
ther details on the training data, training pro- 
cedure, relevant hyperparameters, sampling 
procedures, and other inference-time methods 
are provided in the SM, section D.1. 

During play, we used additional modules 
governing when to speak and to whom, which 
are described in the SM, section D.4. 


Controllable dialogue model 
through intents 


Standard language modeling approaches would 
train our dialogue model only to imitate the 
messages from our dataset but not to outper- 
form them. To go beyond imitation learning, 
we made the dialogue model controllable by 
generating messages conditioned on a plan 
specified by the strategic reasoning module 
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A Intent model training 


Board state & history 
ENG:Do you want NTH to support BEL? 
FRA: No, BELis moving to HOL 


ENG: Alright i'll support you in 


Dialogue history ENG-FRA 


C Dialogue model training 


Board state & history 


ENG:Do you want NTH to support BEL? 
FRA: No, BEL is moving to HOL 


ENG: ... 


Only trained on “truthful” 
situations where a zero-shot 
lie detector says the player 
wasn't lying about their orders. 


ENG:NTH S BEL®HOL . ... 


B_ Intent annotation 


Board state & history 


ENG: Do you want NTH to support BEL? 
FRA: No, BEL is moving to HOL, 


ENG: Alright i'll support you in 


Dialogue history ENG-FRA 


D Dialogue model inference 


Board state & history 


ENG: Bounce in the English Channel? 


ENG:Alright, !'ll 
support you in 


FRA: No, | need to move to MAO 
to protect against Italy 


ENG aes 


FRA: BEL > HOL., ... 


Dialogue history ENG-FRA 


Fig. 2. Illustration of the training and inference process for intent-controlled 
dialogue. Actions are specified as strings of orders for units; for example, “NTH 

S BEL - HOL” means that North Sea will support Belgium to Holland. (A) An “intent 
model” was trained to predict actions for a pair of players on the basis of their 
dialogue. Training data was restricted to a subset in which dialogue is deemed 


(intents), resulting in higher-quality messages. 
More specifically, a message is defined to have 
intent z if z is the most likely set of actions that 
the sender and recipient will take—for both 
the current turn and several future turns—if 
no further dialogue occurs after the message is 
received. To establish this control, we devel- 
oped techniques to automatically annotate 
every message in the training set with a set of 
actions corresponding to the message content. 
During training, the dialogue model learned 
the distribution pp [y® |x®,z®], where z 
represents the intent for datapoint [x®, y?)] ; 
as a result, at inference, time z provides a point 
of control over generation (25). We later de- 
scribe the training and inference process, which 
is also illustrated in the pipeline in Fig. 2. The 
effect of the intents on the generated dialogue 
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Dialogue history ENG-FRA 


“truthful” (SM, section D.2.3). 
annotated with the output of 
with an agreement message i 


(D) During play, intents were 


is demonstrated in Fig. 3; conditioning on 
different planned actions results in different 
messages. 

We considered other notions of intent during 
development, such as controlling messages to 
focus on specific subsets of actions, third-party 
actions, or to have a particular tone. Richer 
intents are harder to annotate on human mes- 
sages, are harder to select with the planning 
module, and create greater risk of taking the 
language model out of distribution. 


Annotating training messages with intents 


When annotating messages in the training data 
with corresponding intents, our goal was for the 
proposed actions 2 to closely reflect the con- 
tent of a message y so that at training time, the 
model learned to exploit the information in 2, 


ENG:Do you want NTH 


Annotated to support BEL? 


/ intents 


ENG:NTH S BEL... 
FRA: BEL H, ... 


"Artificially injected 
agreement 


Intent model 


LEGEND 


Dialogue model 


Planned moves 


ENG: Okay, I'll move (intents) 
to North Sea then. 
—_—> 
ENG:LON » NTH, ... Model inputs 
FRA: BRE > MAO, ... 
aati 
Planning Training targets 


(B) Each message in the dialogue training dataset was 
the intent model on the dialogue up to that point, 
njected at the end. (C) The dialogue model was trained 


to predict each dataset message given the annotated intent for the target message. 


supplied by the planning module instead. 


Naively, we could have used the actual ac- 
tions played by the sender and recipient at 
the end of each turn in the span of the intent. 
However, these actions may not reflect the 
content of a message if (i) a message is not 
honest or (ii) subsequent messages change the 
sender’s plans. To resolve (i), we predicted the 
most likely action according to a dialogue- 
conditional action prediction model trained 
on a “truthful” subset of the dataset, in which 
we predicted that a player’s dialogue was not 
deceptive to others (SM, section D.2). This is 
showcased in Fig. 2A; we refer to this model as 
the intent model. To resolve (ii), we restricted 
the dialogue history that this intent model saw 
up to the message in question, which signaled 
to the model to predict actions as though the 
dialogue had ended at that point in time. We 


9 DECEMBER 2022 * VOL 378 ISSUE 6624 1069 


RESEARCH | RESEARCH ARTICLES 


Depicted intents: 


England convoys an army to Belgium with the 
support of France and Germany while taking Norway 
in a manner friendly to Russia. 


Example generations from the dialogue model with these actions as the conditioned intent: 


ENG FRA Mind supporting Edi - Bel? 


ENG* GER Do you wanna support my convoy to 
Bel? With Italy going aggressive France will fall 
quickly and we can make gains off of both Russia 
and France. 


ENG*® RUS How are you thinking Germany is 
gonna open? | may have a shot at Belgium, but I'd 
need your help into Den next year. 


England convoys an army to Norway and also 
aggressively moves to the Barents Sea against 
Russia. 


ENG» FRA What's the plan for Bel? 


ENG GER I will be convoying my army to 
Norway and moving to Barrents Sea. 


ENG RUS Hey, | moved to Edi to keep my 
options open. Do you have a problem with me 
putting an army in Nwy? 


England convoys an army to Norway and supports 
France to gain Belgium, preventing Germany from 
taking Belgium. 


y .. 


| —— C 


ENG» FRA Let me know if you want me to 
support you to Bel, otherwise I'll probably poke Hol. 


ENG*® GER Looks like you'll get three builds 
unless France bounces you! Are you gonna 
bounce Russia or not? 


ENG RUS Hey, | moved to Edi to keep my 
options open. Do you have a problem with me 
putting an army in Nwy? 


Fig. 3. The effect of intents on Cicero’s dialogue. Pictured are three different possible intents in the same game situation. In each case, we show a message 
generated by Cicero (England; pink) to France (blue), Germany (orange) and Russia (purple) conditioned on these intents. Each intent leads to quite different 
messages, which are consistent with the intended actions. 


additionally added messages to the dialogue 
history that suggested a conclusive agreement 
between the two parties (Fig. 2B). As a result, 
we obtained a high degree of correspondence 
between the action annotated as the intent of 
a message and the content, achieving a score 
of 97% on a small test set designed to measure 
this correspondence (compared with 77% for 
a simpler baseline) (table $2). Then, the dia- 
logue model could be trained in the manner 
described in the above section Imitation dia- 
logue model and in Fig. 2C (SM, section D.2). 


Selecting intents during play 


During play, Cicero used the strategic rea- 
soning module to select intent actions for the 
current turn (Fig. 2D), whereas intent actions 
for future turns were generated by means of a 
human-imitation model. 


Agent intent action for current turn 


Cicero conditioned its dialogue on the action 
that it intends to play for the current turn. 
This choice maximizes Cicero’s honesty and its 
ability to coordinate but risks leaking infor- 
mation that the recipient could use to exploit 
it (for example, telling them which of their 
territories Cicero plans to attack) and some- 
times led to out-of-distribution intents when 
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DIALOGUE QUALITY RATINGS (%) 


Consistent 
with state 
Language model 61.90 
+game state grounding 84.13 
+ intent grounding (CICERO) 87.30 


Consistent High 


with plan quality Perplexity 
76.19 20.64 8.02 
83.33 29.37 7.94 
92.86 37.30 7.70 


Fig. 4. Controllable dialogue modeling results. We report dialogue quality ratings and perplexity on the 
validation set for the Cicero dialogue model and compare them with a baseline without intent grounding and a 
baseline without either intent or game-state grounding (“Language model”). Dialogue quality ratings were 
calculated according to expert annotation of generated messages in 126 situations; we report the percent of 
messages (before filtering) labeled as consistent with the game state, as consistent with the plan for the 
next actions, and as particularly high quality. Lower perplexity corresponds to more probability mass on the 


ground-truth human messages. 


the intended action was hostile, because in 
adversarial situations, humans may rarely com- 
municate their intent honestly. We describe 
approaches for mitigating these risks in the 
section Message filtering. 


Recipient intent action for current turn 


Cicero considered the subset of recipient ac- 
tions with high likelihood under its beliefs 
about their policy. High likelihood requires 
that either an action is deemed beneficial for 


the recipient and/or that they are believed to 
be likely to play it given the dialogue. Among 
this restricted set, Cicero selected the recip- 
ient action with the highest expected value 
for itself (SM, section D.2.4). 


Dialogue modeling results 


We compared the performance of our dialogue 
model with a baseline without intent ground- 
ing and one without intent or game-state 
grounding (a “language model”). We report 
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England agrees: 


ENG FRA Yes! | will move out of ENG if you 
head back to NAO. 


Cicero predicts England will retreat from ENG to NTH 
85% of the time, backs off its own fleet to NAO as 
agreed, and begins to move armies away from the 
coast. 


Fig. 5. The effect of dialogue on Cicero’s strategic planning and intents. 
Cicero (France; blue) and England (pink) are entangled in a fight, but it would be 
beneficial for both players if they could disengage. Cicero has just messaged England 
“Do you want to call this fight off? | can let you focus on Russia and | can focus on 
Italy.” Pictured are three ways that England might reply and how Cicero adapts to 


both perplexity on the validation set and dia- 
logue quality rating scores, which were cal- 
culated on the basis of expert annotation of 
messages generated in 126 Diplomacy game 
situations. Experts were asked to label whether 
a message was (i) consistent with the game 
state, (ii) consistent with the agent’s plan, and 
Gii) notably high quality, compared with that 
of an average human. Results are shown in 
Fig. 4, and more details regarding this eval- 
uation are provided in the SM, section D.2.3. 
Our model outperformed the baselines on 
all metrics. The improvement in validation 
perplexity demonstrated that the model can 
use additional grounding information to 
better predict human messages. Expert anno- 
tations showed that the grounding informa- 
tion provided by the intents and game state 
led to higher-quality messages that were 
highly consistent with the agent’s intended 
action. 


Strategic reasoning 


To generate the intents for dialogue and to 
choose the final actions to play each turn, Cicero 
ran a strategic reasoning module that predicts 
other players’ policies (a probability distribu- 
tion over actions) for the current turn accord- 
ing to the state of the board and the shared 
dialogue and then chose a policy for itself for 
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England is hostile: 


ENG FRA You've been fighting me all game. 
Sorry, | can't trust that you won't stab me. 


Cicero does not back off its fleet but rather attacks EDI 
with it, and leaves its armies at the coast to defend 
against an attack from England, predicting that England 
will attack about 90% of the time. 


the current turn that responded optimally to 
the other players’ predicted policies. 

Doing this with human players requires 
predicting how humans will play. A popular 
approach in cooperative games is to model 
the other players’ policies through supervised 
learning on human data, which is commonly 
referred to as behavioral cloning (BC). How- 
ever, pure BC is brittle, especially because a 
supervised model may learn spurious corre- 
lations between dialogue and actions (fig. S6). 

To address this problem, Cicero used variants 
of piKL (26) to model the policies of players. 
piKL is an iterative algorithm that predicts 
policies by assuming each player 7 seeks to 
both maximize the expected value of their 
policy x; and minimize the Kullback-Leibler 
(KL) divergence between 1; and the BC policy, 
which we call the anchor policy t;. An anchor 
strength parameter A<(0, ~) trades off between 
these competing objectives. 


piKL: KL-regularized planning 

piKL is an iterative algorithm that predicts 
player policies. A complete description of the 
algorithm can be found in the SM, section E.1. 
piKL treats each turn in Diplomacy as its own 
subgame in which each player 7 simultane- 
ously chooses an action q; that results in joint 
action a = (dq, ..., @,), and then each player 7 


England tries to take advantage of Cicero: 


ENG FRA Yes! I'll leave ENG if you move KIE -> 
MUN and HOL -> BEL. 


Strategic planning rejects the possibility of vacating KIE 
and HOL, because it would make Cicero too vulnerable. 
Cicero backs off its fleet to NAO but keeps armies at 
the coast to defend. 


each. (Left and middle) Because Cicero's planning anchors around a dialogue- 
conditional policy model, its predictions for other players and accordingly its own 
plans are flexible and responsive to negotiation with other players. (Right) Yet Cicero 
also avoids blindly trusting what other players propose by rejecting plans that have low 
predicted value and run counter to its own interests. 


receives a reward wa) determined by a value 
function w;. We discuss the training of this 
value function later below. 

piKL assumes player 7 seeks a policy 1; that 
maximizes the modified utility function 


Uj (mi, Wi) = Ui(T, Mi) — ADx(millt%) (1) 


where 1_; represents the policies of all players 
other than 7, and u,(n;, 1_;) is the expected 
value of 2; given that other players play 7_;. 
Specifically, let Q{-1(a;) = u;(a;,n°;') and let 


Qe (aj 
ri(a)=n(ajep( | (a 
On each iteration ¢, piKL updates its predic- 
tion of the players’ joint policies to be 


if (=) nity (7) nt (3) 


piKL provably converges to an equilibrium 
in the modified utility space (26). When the 
anchor strength A is set to a large value, piKL 
predicts that player 7’s policy will be close to 
the anchor policy t;. When A is small, piKL 
predicts that player 7’s policy will have high 
expected value and may deviate substantially 
from 1;. 
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A generalization of piKL referred to as 
Distributional Lambda piKL (DiL-piKL) re- 
places the single \ parameter in piKL with a 
probability distribution over ( values (SM, 
section E.1.3). On each iteration, each player 
samples a A value from their distribution. In 
practice, we found this led to better perfor- 
mance (17). 


Dialogue-conditional planning 


Because dialogue influences the BC policy (the 
anchor policy t,;), pikL provides a mechanism 
for dialogue to influence policy predictions. 
Different possible messages between Cicero 
and another player may produce different an- 
chor policies (Fig. 5), which ultimately gives 
different final predictions about what that 
player will do. 

Other players may of course be deceptive 
about their plans. Cicero does not explicitly 
predict whether a message is deceptive or not 
but rather relies on piKL to directly predict the 
policies of other players on the basis of both 
the BC policy (which conditions on the mes- 
sage) and on whether deviating from the BC 
policy would benefit that player. 

Because dialogue in Diplomacy occurs pri- 
vately between pairs of players, Cicero must 
reason about what information players have 
access to when making predictions. For ex- 
ample, if Cicero is coordinating an attack with 
an ally against an adversary, Cicero’s predic- 
tion of the adversary’s policy must account for 
the adversary not being aware of the intended 
coordination. Cicero accomplished this by pre- 
dicting by means of pairwise piKL what every 
other player’s policy will be. 

Specifically, during strategic planning, for 
each player j, Cicero computed an anchor 
policy for both itself and player j on the basis 
of their shared conversation, the board state, 


Example of coordination - CICERO is AUSTRIA 


and the recent action history. Cicero then ran 
DiL-piKL for the two players to predict player 
js policy. On each iteration, Cicero assumed 
that the remaining five players would play 
according to a policy computed by means of 
RL, conditional on the policies of Cicero and 
player j. This process gave an independent 
prediction of each player’s policy. 

Next, Cicero accounted for the players’ pol- 
icies not being independent owing to their 
ability to correlate their actions through pri- 
vate dialogue that Cicero did not observe. 
Cicero accomplished this by constructing an 
approximate joint policy for all other players 
through self-normalized importance sampling: 
We sampled NV = 1000 joint actions a from the 
independent piKL policies of the other players 
and reweighted them by the likelihood ratio of 
a under the correlated and independent RL 
policies, respectively. 

Last, Cicero chose the action a; that best 
responds to the predicted joint policy 1_; of the 
other players, while still being as consistent 
as possible with its dialogue. Specifically, 
Cicero chose the action argmaxg,w;(@i, mi) + 
Mogt;(a;), where w; is the RL value function, 
t,(a,) is the probability of the action under the 
dialogue-conditional imitation policy, and (4 = 
3 x 10%. Cicero used a smaller A for regulariz- 
ing its best response than for its computation 
of other players’ policies; thus, the dialogue 
more strongly informed Cicero’s expectations 
of how other players would coordinate while 
still allowing Cicero more leeway to deviate 
when the action that it predicted humans 
would most likely choose in its situation was 
suboptimal. 


Self-play RL for improved value estimation 


Applying piKL requires a state value function. 
Self-play provides an avenue for training such 


Example of negotiation - CICERO is FRANCE 


( >) 
ITALY: What are you thinking long term? 
Should | go for Turkey or head west 


AUSTRIA: Yeah, he went to Armenia which is really 
great. You can go either way, but if Turkey is 
committing to Russia you could always lepanto 


AUSTRIA:A lepanto into Turkey is really really strong, 
especially since he committed so hard against Russia 


ITALY: |'m down to go for it. Would 
definitely need your help in 02 though 


AUSTRIA: Of course, happy to do that! 


ITALY: Fantastic! 


X J 


ae \ 


FRANCE: 1'll work with you but | need Tunis for now. 
TURKEY: Nope, you gotta let me have it 


FRANCE: No, | need it. 


FRANCE: You have Serbia and Rome to take. 
TURKEY: they're impossible targets 
FRANCE: Greece - lonian lonian - Tyrr 


TURKEY: hm, you're right 


TURKEY: good ideas 


FRANCE:Then in fall you take Rome and Austria collapses. 
XM wy 


Fig. 6. Successful dialogue examples. Examples of Cicero (left) coordinating and (right) negotiating with 


authors of this paper in test games. 


1072 9 DECEMBER 2022 + VOL 378 ISSUE 6624 


a value function but risks becoming incom- 
patible with human play (6, 17). To address 
this, we used piKL during self-play to keep the 
policies human-compatible. 

One challenge in doing self-play in Diplomacy 
is that players may adapt their actions sub- 
stantially on the basis of dialogue with other 
players, including coordinating joint actions. 
Explicitly simulating conversations would 
be extremely expensive in RL. However, a key 
insight is that a joint, shared BC policy trained 
on the joint action distribution of the human 
data already implicitly captures the effects of 
dialogue on the action distribution of human 
players by modeling that action distribution 
directly. 

We therefore developed Correlated and 
Shared (CoShar) piKL, which allowed for reg- 
ularization toward a joint, correlated anchor 
policy t shared by all players rather than 
toward per-player policies. In this way, we 
relied on the joint anchor policy to capture 
the correlation between all players’ policies. 
Specifically, CoShar piKL differs from standard 
piKL in that the probability of joint action a = 
(dy. Gn) in policy x“* becomes 


| 


n (a) < 1(a)exp b 
We found that CoShar piKL retained much 
of the correlation present in the joint anchor 
policy t while also modeling strong human 
play better than imitation alone. 

Our resulting self-play algorithm operated 
similarly to AlphaZero (27) and ReBeL (28) by 
applying planning “in the loop” as the im- 
provement operator for RL. In our case, plan- 
ning was by use of an approximated version 
of CoShar piKL. We generated self-play tra- 
jectories in which on each turn we computed 
the CoShar piKL policy using a learned state- 
value model. We regressed the joint policy 
model toward that policy and regressed the 
value model toward the expected values of all 
players under that policy. We then sampled a 
joint action from that policy to generate the 
next state in the trajectory. The anchor policy 
was fixed throughout training to anchor the 
RL near human play (SM, section E.4). 


Message filtering 


Prior work has shown that neural language 
models suffer from contradictions and incon- 
sistency as well as a tendency to “hallucinate,” 
or generate factually incorrect information (29). 
In the complex domain of Diplomacy, dialogue 
models exhibit both these problems and other 
more subtle mistakes, such as deviations from 
the intents used to control the message or 
blunders in the strategic content of the mes- 
sage. We approached this problem by filtering 
generated messages using a series of classifiers 
and checks to detect common issues. We outline 
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several of these filters here, with additional 
details in the SM, section D.3. 


Discriminating between human text 
and counterfactuals 


Much work has used adversarial or counter- 
factual examples to improve the robustness of 
natural language systems (30, 31). Following 
this approach, we generated many kinds of 
counterfactual messages that contained mis- 
takes that language models are prone to, in- 
cluding heuristically corrupted text as well as 
model-generated negatives. We trained a suite 
of 16 classifiers to discriminate between the 
ground-truth human message and different 
kinds of counterfactual messages (sometimes 
varying the random seed or context informa- 
tion available) and used these classifiers in an 
ensemble to filter messages. This approach 
risked overly filtering complex messages that 
contain precise plans and accepting bland 
messages, such as “ok,” which are unlikely to 
contain mistakes. However, we found that care- 
fully designing our ensemble allowed us to 
filter most nonsensical messages with mini- 
mal impact on message complexity: On a small 
evaluation set with 362 expert-annotated ex- 
amples, we found that we could detect 83% 
of nonsense messages, without substantial im- 
pact to message diversity as measured by the 
proxy of message length and the number of 
references to Diplomacy-specific entities (SM, 
section D.3.1). 


Intent correspondence 


As noted previously, controlling dialogue gen- 
eration through intents has the twofold benefit 
of improving the strategic value of a message 
and reducing discussion of impossible moves 
or other hallucinations. However, this control 
is imperfect, and the dialogue model may gen- 
erate messages that contradict the intents it 
conditions on. To address this, we filtered mes- 
sages that would reduce the likelihood of the 
actions in the intent. Evaluating this method 
on a small test set of 1013 expert-annotated 
messages, we achieved a recall of 65%, filtering 
24% of all messages (SM, section D.3.2). 


Value-based filtering 


Conditioning on intents can lead to “infor- 
mation leakage,” in which the agent reveals 
compromising information about its plan to 
an adversary (section Selecting intents during 
play). To mitigate this, we developed a method 
to score potential messages by their estimated 
value impact. We computed the piKL policies 
for all agents after each candidate message 
and filtered those that led to a lower expected 
value (EV) for Cicero playing its intended 
action. Expert evaluation on a set of 127 dia- 
logue scenarios demonstrated that accepted 
messages were preferred over filtered messages 
62% of the time (P < 0.05) (SM, section D.3.3). 
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Other filters 

We additionally deployed other filters—for ex- 
ample, to detect toxic language (SM, section 
D.3.4)—and heuristics to curb bad behaviors, 
including repetition and off-topic messages 
(SM, section D.3.5). 


Cicero in anonymous human play 

Cicero participated anonymously in 40 games of 
Diplomacy in a “blitz” league on webDiplomacy.net 
from 19 August to 13 October 2022. This league 
played with 5-min negotiation turns; these time 
controls allowed games to be completed within 
2 hours. Cicero ranked in the top 10% of par- 
ticipants who played more than one game and 
second out of 19 participants in the league that 
played five or more games. Across all 40 games, 
Cicero’s mean score was 25.8%, which was more 
than double the average score of 12.4% of its 
82 opponents. As part of the league, Cicero 
participated in an eight-game tournament that 
involved 21 participants, six of whom played 
at least five games. Participants could play a 
maximum of six games, with their rank deter- 
mined by the average of their best three games. 
Cicero placed first in this tournament. 

During games, players were not able to 
see the usernames of other players. Although 
webDiplomacy notifies users that the website 
has participated in AI research and that cer- 
tain game modes allow users to play with AI 
agents, we evaluated Cicero in games with 
humans in which the participants were not 
explicitly informed that they were playing 
with an AI agent for that particular game. 
Cicero’s participation as an AI was revealed to 
all players at the conclusion of the research 
(SM, section A.4). 


Discussion 


Cicero successfully combined strategic reason- 
ing and dialogue to cooperate and negotiate 
with humans on a complex task, achieving 
strong human-level performance in the game 
of Diplomacy. Furthermore, Cicero passed 
as a human player for 40 games of Diplomacy 
with 82 distinct players, and no in-game mes- 
sages indicated that players believed that 
they were playing with an AI agent. One 
player mentioned in post-game chat a suspi- 
cion that one of Cicero’s accounts might be 
a bot, but this did not lead to Cicero being 
detected as an AI agent by other players in 
the league. 

Two examples of coordination and negoti- 
ation are shown in Fig. 6. In the coordination 
example, we observed Cicero building an al- 
liance through discussion of a longer-term 
strategy. In the negotiation example, Cicero 
successfully changed the other player’s mind by 
proposing mutually beneficial moves. Despite 
dishonesty being commonplace in Diplomacy, 
we were able to achieve human-level perfor- 
mance by controlling the agent’s dialogue through 


the strategic reasoning module to be largely 
honest and helpful to its speaking partners. 

Although Cicero is shown to be effective at 
cooperating with humans, it occasionally sent 
messages that contained grounding errors, 
contradicted its plans, or were otherwise stra- 
tegically subpar. Although we reduced errors 
with a suite of filters, Diplomacy poses an in- 
teresting benchmark for studying this prob- 
lem. We suspect that these mistakes did not 
raise further suspicions that Cicero was an AI 
agent because of the time pressure imposed 
by the game, as well as because humans occa- 
sionally make similar mistakes. As such, formats 
of Diplomacy with longer negotiation periods 
could provide an even further challenge for 
future work because players typically engage 
in more detailed and complex negotiation in 
these formats. 

From a strategic perspective, Cicero reasoned 
about dialogue purely in terms of players’ ac- 
tions for the current turn. It did not model 
how its dialogue might affect the relationship 
with other players over the long-term course 
of a game. Considering this might allow it to 
deploy dialogue more strategically. Further- 
more, the expressive power of our intent rep- 
resentation limited Cicero’s ability to control 
richer affordances of dialogue such as strate- 
gically revealing information, asking questions, 
or providing explanations for its actions. There 
remain many open problems for intentional 
use of dialogue, and Diplomacy provides a rich 
testbed to explore these connections between 
strategy and communication, with the goal 
of improving coordination between humans 
and agents. 


Ethical considerations 


We discuss ethical considerations for this re- 
search further in the SM, including privacy 
considerations for data usage (SM, section A.1), 
potential harms resulting from toxic or biased 
language generation (SM, section A.2), avenues 
for misuse of goal-oriented dialogue technol- 
ogy (SM, section A.3), and AI agent disclosure 
to human players (SM, section A.4). 
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A new biologic paleoaltimetry indicating Late 
Miocene rapid uplift of northern Tibet Plateau 


Yuntfa Miao’?>*, Xiaomin Fang>*, Jimin Sun*, Wenjiao Xiao™”, Yongheng Yang’, Xuelian Wang**, 
Alex Farnsworth®, Kangyou Huang”*, Yulong Ren®, Fuli Wu°, Qingqing Qiao’, Weilin Zhang®, 


Qingquan Meng®, Xiaoli Yan®, Zhuo Zheng’, Chunh 


ui Song®, Torsten Utescher? 


The uplift of the Tibet Plateau (TP) during the Miocene is crucial to understanding the evolution of Asian 
monsoon regimes and alpine biodiversity. However, the northern Tibet Plateau (NTP) remains poorly 


investigated. We use pollen records of montane conife 


rs (Tsuga, Podocarpus, Abies, and Picea) as a new 


paleoaltimetry to construct two parallel midrange paleoelevation sequences in the NTP at 1332 + 189 m 


and 433 + 189 m, respectively, during the Middle Mioc: 


ene [~15 million years ago (Ma)]. Both midranges 


increased rapidly to 3685 + 87 m in the Late Miocene (~11 Ma) in the east, and to 3589 + 62 m at ~7 Ma in the 
west. Our estimated rises in the east and west parts of the NTP during 15 to 7 Ma, together with data from 
other TP regions, indicate that during the Late Miocene the entire plateau may have reached a high elevation 
close to that of today, with consequent impacts on atmospheric precipitation and alpine biodiversity. 


he Tibet Plateau (TP) is the world’s 
highest inhabited terrain, with an area 
>2.5x10° km? and an average elevation 
>4000 meters above sea level (m asl); it 
is also commonly referred to as the 
“Third Pole” or the “Asian Water Tower”. Its 
uplift is a direct expression of the geodynamic 
processes of Earth's interior caused by the 
collision of the India-Asia plates (7-3) and 
regarded as the main factor driving the evo- 
lution of the Asian monsoon regime (3-6) 
and alpine biodiversity (7-9). Despite con- 
siderable controversy (10, 11), the Miocene 
epoch is now deemed critical for under- 
standing the timing of the attainment of 


the highest paleoelevation of the TP (12-15), 
the development of the modern Asia mon- 
soon regime (3-6), and the acceleration of 
the evolution of alpine biodiversity (7, 8). 
However, compared with the abundant proxy 
data and paleoelevation reconstructions for 
the southern TP (e.g., based on mammalian 
ecology (16), plant fossils (15), sedimentary 
characteristics including isotope geochemistry 
(17), and tectonic collapse) (72), the northern 
part—despite being an integral part of the 
TP—remains poorly understood. 

Tectonic and sedimentary evidence has shown 
that the NTP has generally been increasing in 
height since the Middle-Late Miocene (18-23), 
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Fig. 1. Locations of sites used for pollen-based paleoelevation reconstruction 
in the NTP and the distribution of surface pollen samples in Asia. (A) 1-HTTL 
section, 2-YH section, 3-KC-1 core, and 4-SG-1 core used in this study and other 
sites referenced in the adjunct Qaidam Basin: XN (Xining) and ZK (Zeku); GDGTs 


but much older (Oligocene or Eocene) (24, 25) 
or younger (Late Pliocene) (26) uplifts of the 
region have recently been reported. There is 
also no consensus on when and how such 
rapid uplift began to form present range- 
basin configurations (18-20, 23, 25-27). Paleo- 
elevation reconstruction can provide key 
constraints to clarify this question. However, a 
few current estimations of the elevation his- 
tory of the NTP from isotope paleoelevation and 
biology indicate a high discrepancy (16, 27-30). 
Hydrogen isotope composition and glycerol 
dialkyl glycerol tetraethers (GDGTs)-based 
paleotemperature history derived from organic 
geochemistry demonstrated that the NTP expe- 
rienced ~2100 to 1000 m of rapid uplift during 
the Middle to Late Miocene (27-29), whereas 
the oxygen isotope data from paleosol carbon- 
ate nodules and Climate-Leaf Analysis Multi- 
variate Program (CLAMP) analysis of fossil leaves 
in the Qaidam Basin in the central NIP showed 
that the basin rapidly increased in elevation 
from ~2000 m at 52 to 44 Ma (3D) to its pres- 
ent elevation at ~3300 m at ~30.8 Ma (32). In 
addition to the difficulties resulting from large 
uncertainties rooted in use of isotope-based 
paleoelevation (0, 33, 34), biological evidence 
from fossil mammals, leaves, and pollen in the 
NTP revealed a very different uplift scenario, 
all demonstrating much lower elevations and 
an absence of high Mountains surrounding the 
Qaidam Basin during the Miocene (J6, 35). This 
inhibits a comprehensive understanding of the 
paleoelevation history of the entire TP and 
obscures its role in the evolution of atmospheric 
circulation (precipitation) or biodiversity. 
Montane conifers only grow in higher ele- 
vation regions of Earth’s surface (36, 37) and 


SCIENCE science.org 


Elevation (m asi} 


Longit ide 


consequently their growth and reproduction 
are sensitive to changes in elevation (38, 39). 
We focus on the pollen records of four conifer 
genera (Tsuga, Podocarpus, Abies, and Picea) 
with easily identifiable vesiculate palynomor- 
phology, obtained from sedimentary sequen- 
ces in the NTP. Our aims were to reconstruct 
two parallel elevation histories since 16 Ma, 
produce a geodynamical end-element model 
of uplift, and link the paleoelevation records 
from across the TP, to reveal the impacts of 
uplift on climatic and biological evolution. 

The Qaidam Basin covers an area of ~2.5 x 
10° km? with an average elevation of 3000 + 
200 m asl. It is surrounded by three mountain 
ranges: the Kunlun, Altyn Tagh, and Qilian, 
which have elevations of ~4000 to 5500 m asl 
(Fig. 1A). During the Cenozoic the basin accumu- 
lated enormous quantities of clastic sediments 
derived from the adjacent Mountains In the 
eastern basin, Site 1 [Huaitoutala (HTTL) sec- 
tion] (37°13' N, 96°43’ E, 2950 m asl) has mag- 
netostratigraphic ages of Middle Miocene to 
Pliocene (15.6 to 4.0 Ma) (78), whereas Site 2 
[Yahu (YH) section] (37°46’ N, 93°36’ E, 2838 m 
asl) is dated to the Pliocene to Early Pleistocene 
(5.3 to 1.8 Ma) (40). We collectively name these 
two sites East Qaidam. In the western basin, Site 3 
(core KC-1) (38°03' N, 91°45’ E, 2820 m asl), dated 
to 18 to 5 Ma (4D, and Site 4 (core SG-1 series) 
(38°24 N, 92°30’ E, 2900 m asl), dated to 7 to 
0 Ma (42), are contemporaneous parallel records, 
and we collectively name them West Qaidam 
(Fig. 1A). The maximum distance between these 
eastern and western sites is >400 km. 

At Site 1 we conducted pollen analyses on 
34 samples which almost met the statistical 
requirements of a total >300 palynomorphs 


lig 
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229 - 
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(branched glycerol dialkyl glycerol tetraethers); (B) Distributions of the four 
conifer genera and the 3088 surface pollen samples; (C) [Tsuga (%) + Podocarpus 
(%)] / [Tsuga (%) + Podocarpus (%) + Abies (%) + Picea (%)] (TP/TPAP) ratios 
across the Central East Asia. The base map is a Digital Elevation Model (DEM). 


[see Sections 1.1 and 1.2 in (43)]. Additionally, 
we used previously published pollen data for 
214, 48, and 707 samples from Sites 2 (40), 3 
(4D), and 4 (42), respectively, which were ana- 
lyzed by our methodology (Fig. 2, A to D). 
Tsuga, Podocarpus, Abies, and Picea are 
four montane conifer genera represented in 
the pollen assemblages. Although influenced 
by factors such as precipitation, geographical 
location, and soil properties among others, 
these tree taxa have elevation-specific habitats 
in Asia (Figs. 1B and 3A): Tsuga and Podocarpus 
grow at middle elevations (300 to 3500 and 
600 to 2000 m asl, respectively) whereas Abies 
and Picea grow at higher elevations (500 to 
4700 and 300 to 4700 m asl, respectively) 
(36, 37). Therefore, the representation of these 
genera in pollen assemblages can be used to 
reconstruct paleoelevation. To do this, we first 
calculated the ratios of [Tswga (%) + Podocarpus 
(%)] to [Tsuga (%) + Podocarpus (%) + Abies 
(%) + Picea (%)] (abbreviated to the TP/TPAP 
ratio) at 1-Ma intervals to reduce the influence 
of differences in sample resolution and to eval- 
uate their relationships with climatic fluctua- 
tions. The results show that both TP/TPAP 
records decrease continuously, with the fitting 
of third-degree polynomial curves resulting in 
coefficient of determination (R”) values of 0.94 
and 0.90 for East and West Qaidam, respec- 
tively (fig. S3C). We also selected steppe taxa 
(e.g., Ephedra, Nitraria, and Amaranthaceae) 
to represent changes in dryness. The results 
show that in East Qaidam the minimal steppe 
contents occurred at ~8 Ma, whereas in West 
Qaidam there was a continuous intensifica- 
tion of aridification (Fig. 2E). Notably, the TP/ 
TPAP ratios are inconsistent with the records 
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Fig. 2. Lithology, ages, and 
simplified pollen assem- 
blages. (A) and (B) Sites 1 
and 2 in East Qaidam, respec- 
tively. (C) and (D) Sites 3 
and 4 in West Qaidam, respec- 
tively. Site 1 is a new pollen 
study and sites 2 to 4 are 
edrawn from references 
(40-42). (E) Steppe taxa per- 
centages and (F) TP/TPAP 
atios for East and West 
Qaidam at 1-Ma intervals. 
Green leaf symbol, plant fossil 
horizon discovered in this 
study; fish symbol, horizon of 
the Cyprinid Hsianwenia wui 
eferenced in this study (46). 
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of tetraether temperatures of soil and lacus- 
trine sediments from East Qaidam (28), which 
are based on 9 soil and 15 lacustrine facies 
samples, with an average temperature devia- 
tion between them of 4.3 to 4.6°C, and which 
reveals a complicated pattern of variation (fig. 
83, D to E). Furthermore, the TP/TPAP ratios 
also differ from that of the global surface tem- 
perature (44), which shows a lower amplitude 
of change weaker than that of the TP/TPAP 
ratios, especially during 16 to 8 Ma (figs. $3, D 
and E, and S10). 


Paleoelevation verification and results 


After evaluating the influences of atmospheric 
and fluvial dynamics, precipitation, tempera- 
ture, and CO, on the TP/TPAP [Sections 1.5 and 
1.6 in (43)], the modern TP/TPAP-elevation 
relationship is the foundation of our eleva- 
tion reconstruction, and our methodology is 
as follows: First, we determined the modern 
spatial distribution of the TP/TPAP ratios, 
based on 3088 surface pollen samples from 
Asia. The results indicated that most of the 
samples with high values are from the lower 
mountain areas in southeastern Asia (Fig. 1C). 
Second, we determined linear relationships 
between the TP/TPAP ratio and elevation. 
The corresponding R? value reached 0.90, after 
correcting the samples to a specific latitude 
(here, 37.2° N), and then calculating the arith- 
metic mean for elevation bins at 200-m inter- 
vals [Section 1.4 in (43), and Fig. 3B]. Third, to 
test the reliability of the relationships, we cal- 
culated the elevations of a series of pollen sites 
from across the TP (the Himalayas and its 
deep interior) (five sites of Quaternary age 
and five sites of Miocene-Pliocene age), to- 
gether with one site in Japan near sea level 
[Section 1.9 in (43)]. The results for the younger 
interval are consistent with the middle altitu- 
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dinal range (midrange) of the drainage (R? = 
0.80) (table S7). Additionally, the results for 
the older interval, after correction for the tem- 
perature difference (Tp, paleo and modern at 
sea level, table S5) and the terrestrial temper- 
ature lapse rates (TLR) (table S6) using more 
reliable wet-bulb derived terrestrial lapse rates 
(45) [Section 1.7 in (43)], are well matched 
with records based on isotopes, fossil species 
with the nearest living relative, and on the 
CLAMP (table S11). In the NTP, our recon- 
structed paleoelevation record shows that the 
parallel midrange paleoelevation of the area 
adjacent to East Qaidam rose rapidly, from 
1332 + 189 to 3685 + 87 m asl during the in- 
terval of ~15 to ~11 Ma (an increase of ~2400 m) 
(Fig. 4A and table S9). The region adjacent 
to West Qaidam also rose rapidly, from 433 + 
189 to 3589 + 62 m asl during the interval from 
~15 to ~7 Ma (an increase of ~3200 m) (Fig. 4A 
and table S9). Fourth, to determine the basin 
and mountain elevations separately, we also 
estimated the basin paleoelevation of the 
Qaidam Basin using the pollen-based Co- 
existence Approach during 16 to 14 Ma and 
the plant fossil-based CLAMP during ~11 Ma, 
which indicate basin paleoelevations of 1102 + 
429 and 2355 + 703 m asl, respectively (Fig. 4, 
B and C). We also cite an estimate of the basin’s 
paleoelevation of ~2362 + 62 m asl, based on 
the occurrence of the Cyprinid Hsianwenia 
qwui gen. et sp. nov. from the YH section (Site 2) 
(Late Miocene to earliest Pliocene) (46, 47) 
[Sections 1.8 and 1.9 in (43)], indicating a 
~1300 m increase. Thus, we estimate that the 
montane paleoelevation of the region adjacent 
to East Qaidam increased from ~1400 to 
4200 m asl, and then to 5023 m asl at ~15, 
~11, and ~7 Ma, respectively, corresponding to 
respective increases of ~2800 and ~800 m. 
In summary, the NTP may have experienced 
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very strong uplifts during the Middle to Late 
Miocene and attained an elevation similar to 
its present one during the Late Miocene (8 to 
6 Ma). 


Rise of the NTP 


Relatively few paleoelevation reconstructions 
exist for the basin and Mountains in the NTP, 
and the available reconstructions have large 
potential uncertainties depending on the dif- 
ferent proxies and methods used (28, 29, 48). 
Tetraether temperatures revealed a relative- 
ly substantial relief (>700 to 1100 m) between 
the Qaidam Basin and the Qilian Mountains 
since ~12.4 Ma (28). A multidisciplinary ap- 
proach incorporating regional zircon geochro- 
nology indicated that the Jishi Mountains, to 
the southeast of the Qilian Mountains, were at 
a low elevation during 14 to 9 Ma and then 
rose abruptly at ~8 Ma to reach an elevation 
sufficient to block the passage of water vapor 
(49). Tectonically, there was the synchronous 
emergence and rapid erosion of the Qilian 
Mountains at ~12 Ma (28). Thermodynamics- 
based evidence also exists for the onset of the 
rapid exhumation of the Qilian Mountains at 
~10 Ma (50). Our results have quantified the 
paleoelevation of most of the crustal shorten- 
ing across the Qilian Mountains that accumu- 
lated during the Late Middle to Late Miocene. 

For the basin elevation, leaf wax hydrogen 
isotopes (28) indicate that the elevation of the 
Qaidam Basin increased by ~2100 m during 15 
to 10 Ma—almost twice that indicated by our 
results (a rise of ~1100 m). At the same time, 
the elevation of the Xining Basin to the east 
increased rapidly by ~1000 m at ~10 to 8 Ma, 
according to a record of GDGTs (29). Since the 
present-day elevation of the Xining Basin is 
~2200 m asl, the paleoelevation of the basin 
should have been <1200 m asl at 10 to 8 Ma. 
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Fig. 3. Elevation of the studied conifer genera corrected for latitude and pollen results versus corrected elevation. (A) Sketch map of genera elevations at 
a specific latitude and results corrected for latitude (arrows show the directions of latitudinal correction). (B) Correlation between the TP/TPAP ratios and the 
elevation at 200 m intervals, after correcting the surface pollen samples to the latitude of 37.2° N (light green shaded area is the 95% confidence interval). (C) Sample 
numbers of surface pollen assemblages for each 200-m height interval. 


Fig. 4. NTP uplift and impact on A 
precipitation. (A) Pollen-based 8 
; >) 
paleoelevation curves at 1-Ma g + 
intervals with standard error of the = 
mean error bars (table S9), with =i S 
< 6° 
terrestrial temperature lapse rates = 7 
(twp) of 3.1 to 4.0 °C 1000 mt 3 
(table S6), and green and blue lines 2 8 
show the LOESS Fit (Span = 0.5). g = 
Purple line shows a cumulative oa 


curve of tectonic event numbers 

(CCTE) in the NTP (north of the 

Kunlun Suture). (B) and (C) Illustra- 

tions of the distribution of conifers 

versus elevation for two time slices, 

after temperature correction. The 

black brace and arrow show the B 
midrange of the drainage and 

the pollen transport direction, 
respectively. The cyan shading 
indicates rivers and lakes within 
the lowest part of the basin. 

(D) RegCM4.6 model results 
showing the percentage precipitation 
anomaly after reducing the 

elevation of the NTP by 67% 
[Section 1.11 in (43)]. The red line 
shows an outline of the entire TP, 
and the dashed lines indicate the 
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boundaries of the NTP, Himalayas 
(HY), and Hengduan Mountains (HD). 


Additionally, the elevation of the Zeku Basin 
(see Fig. 1 for location, present day eleva- 
tion 3440 to 4300 m asl) during the Middle 
Miocene is speculated to have been 1200 to 
1400 m asl (57) or <2400 m asl (52). Combin- 
ing this evidence, we conclude that the basins 
in the NTP were at lower elevations before the 
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Late Miocene and were then intensely uplifted 
until 8 to 6 Ma. 


Maximum elevation of the entire TP 


In the southern TP, there are several north- 
south rifts and normal faults (fig. S17), includ- 
ing Leo Pargil (15.5 to 14.5 Ma) (53), Thakkhola 
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Hydrothermal Muscovite (~14 Ma) (12), Shuanghu 
Graben (~13.5 Ma) (13), and Yangbajing Graben 
(8 Ma) (3). Additionally, stable isotope evidence 
indicates the following elevations and ages: 
Namling-Oiyug Basin (~15 Ma, 5200 + 1400 to 
600 m) (54) or (~5400 m) (55), Mount Everest 
(15 Ma, 5100 to 5400 m) (56), Zhada Basin 
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(9.2 Ma, ~5500 to 6500 m) (14), Thakkhola (11 
to 7 Ma, ~5500 m) (57), and the Gyirong Basin 
(10 Ma, 5800 m) (58). These estimates are sup- 
ported by our pollen-based results, which to- 
gether indicate that the elevation of the entire 
TP during the late Middle to early Late Miocene 
may have been similar to that of today. 

These records are consistent with the con- 
clusion that east-west extension caused the 
elevation of the Kailas and Qiabulin areas in 
the southern TP to decrease by ~1000 m in the 
Late Miocene (59) whereas the elevation of the 
Songpan Basin in the eastern TP increased to 
a height similar to its present day elevation by 
11 Ma (60). Molnar et al. (61) proposed a “con- 
vective stripping model” to explain the uplift 
and east-west extension of the plateau, based 
on the development of north-south normal 
faults, which was subsequently supported by 
seismic results (62). Overall, the rapid uplift 
of the Qaidam Basin and Qilian Mountains in 
the Middle to early Late Miocene was almost 
simultaneous with the east-west extension of 
the rifting in the central-southern TP (67) and 
other coeval tectonic events within and around 
the TP [a summary is given in (25)]. These 
findings support the end-member model of 
the removal of mantle lithosphere beneath 
different portions of the TP (3, 25, 67). 


Impact of uplift on climate and 
biodiversity changes 


We propose that the entire TP may have reached 
its maximum elevation and spatial extent during 
the Late Miocene, which would have affected 
regional climatic and biological evolution 
through physical blocking and thermodynam- 
ic effects on the transport of atmospheric water 
vapor and the resulting precipitation (3, 6, 63). 
To quantitatively evaluate the precipitation 
variability in the TP, we ran the Regional Cli- 
mate Model (RegCM4.6) with a 30-km resolu- 
tion, reducing the elevation of the NTP to 33% 
in a sensitivity test (similar to the scenario of a 
low TP during the Middle Miocene, called exp. 
LM). By contrast, a control test was run which 
maintained the present-day topography of 
the TP, called exp. HP [Section 1.11 in (43)]. The 
results show that, in contrast to exp. HP, the 
annual precipitation in the NTP was reduced 
by >50% in exp. LM (Fig. 4D) and was slightly 
higher in the spring and summer than in 
the autumn and winter (fig. S21). However, 
in the Himalayas and Hengduan Mountains 
to the south and southeast, the precipitation 
increased by 50% and 150%, respectively (Fig. 
4D). These precipitation changes are supported 
by the results of the RegCM4.1 (64), the Com- 
munity Atmosphere Model (CAMA4) (65), and 
the Weather Research and Forecasting model, 
nested within CAM4 (66), which was used to 
model the removal of the NTP as well as other 
marginal regions surrounding the TP. The 
principal reasons for the precipitation changes 
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are thought to be the strengthening of the East 
Asian summer monsoon circulation (65) and 
a decrease in local precipitation in the Asian 
interior, driven by the enhanced rain shadow 
effect of the Mountains and changes in re- 
gional circulation (64). Progressive aridifica- 
tion in the west is evidenced by the increase 
in steppe taxa in pollen assemblages (Fig. 2E) 
and occurrence of the unusual thick-boned 
Cyprinid, Hsianwenia wui (46) during the Late 
Miocene to Early Pliocene, whereas in the east 
a wetting-drying transition occurred at ~8 Ma 
(Fig. 2E). Despite this spatial inconsistency, 
both Podocarpus and Tsuga flourished in areas 
of low topography and then decreased slightly 
until 8 to 6 Ma, although Picea and Abies also 
flourished in the east (Fig. 2B) and decreased 
in the west (Fig. 2D), and the trends of the TP/ 
TPAP ratios are synchronous (Fig. 2F). 

The pronounced precipitation increase in the 
Himalayas and Hengduan Mountains is also 
correlative with increases in the diversity of 
vegetation (8) and reptiles (7) in these regions 
during the Middle to Late Miocene, against 
the background of a high TP. Thus, we sug- 
gest that these increases in biodiversity were 
driven jointly by mountain building and in- 
tensification of the Asian monsoon (8, 67) and 
that overall, the uplift of the NTP can be seen 
to have contributed substantially to climate 
change and biological evolution in Asia. 

We used two parallel montane conifer 
pollen records from the NTP to reconstruct 
paleoelevation records for the interval since 
the Middle Miocene. Our results clearly show 
that the high elevation in the NTP was at- 
tained at ~10 to 7 Ma, supporting the geo- 
dynamics model of the removal of mantle 
lithosphere beneath different portions of the 
TP. The rise of the NTP during the Miocene 
may also have enhanced the role of top- 
ography in accelerating the rate of change of 
regional biodiversity through the influence of 
topography on precipitation. 
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Rapid weed adaptation and range expansion in 
response to agriculture over the past two centuries 


Julia M. Kreiner'*, Sergio M. Latorre**, Hernan A. Burbano*“, John R. Stinchcombe®, 


Sarah P. Otto~®, Detlef Weigel*, Stephen I. Wright® 


North America has experienced a massive increase in cropland use since 1800, accompanied more 
recently by the intensification of agricultural practices. Through genome analysis of present-day and 
historical samples spanning environments over the past two centuries, we studied the effect of 

these changes in farming on the extent and tempo of evolution across the native range of the common 
waterhemp (Amaranthus tuberculatus), a now pervasive agricultural weed. Modern agriculture has 
imposed strengths of selection rarely observed in the wild, with notable shifts in allele frequency 
trajectories since agricultural intensification in the 1960s. An evolutionary response to this extreme 
selection was facilitated by a concurrent human-mediated range shift. By reshaping genome-wide 
diversity across the landscape, agriculture has driven the success of this weed in the 21st century. 


gricultural practices across North America 

have rapidly intensified over the past 

two centuries through cropland expansion 

(1), monoculture plantings (2, 3), and 

increased chemical inputs (4, 5). Since 
the beginning of the 1800s, cropland usage 
has expanded from 8 million to 200 million 
hectares in Canada and the United States 
alone (1). Since the mid-1900s, development 
of new crop varieties—including high-yield 
and herbicide-resistant wheat, corn, and soy 
(6, 7)—has greatly improved the efficiency of 
food production in all farming sectors. Com- 
bined with increased reliance on pesticides, 
fertilizers, irrigation, and large-scale mecha- 
nization, this global transformation is often 
referred to as the agricultural Green Revolution 
(8-10). Pesticide effectiveness, however, has 
been limited by the evolution of resistance 
across numerous pest species (77-14). Although 
technological innovation for efficient food pro- 
duction has risen with increasing global food 
demands, the concomitant landscape conver- 
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sion has become one of the foremost drivers 
of global biodiversity loss (75). 

Species that have managed to survive, and 
even thrive, in the face of such extreme envi- 
ronmental change provide notable examples of 
rapid adaptation on contemporary time scales 
and illustrate the evolutionary consequences 
of anthropogenic change. One such species 
is the common waterhemp (Amaranthus 
tuberculatus), an agricultural weed that is 
native to North America and persists in large 
part in natural, riparian habitats (16, 17), which 
provides an opportunity to investigate the time 
scale and extent of contemporary agricultural 
adaptation. The genetic changes underlying 
weediness are particularly important to under- 
stand in A. tuberculatus because it has recently 
become one of the most problematic agricul- 
tural weeds in North America as a result of its 
widespread adaptation to herbicides, its per- 
sistence in fields across seasons, and its strong 
ability to compete with both soy and corn 
(18, 19). Determining the roles of newly arisen 
mutations, genetic variants predating the onset 
of environmental change (20, 27), migration 
across the range (22), and their interactions 
(23, 24) will inform the temporal and spatial 
scales at which contemporary adaptation occurs 
and management strategies should be used. 

To understand how changing agricultural 
practices have shaped the success of a ubiq- 
uitous weed, we analyze genomic data from 


contemporary paired natural and agricultural 
populations alongside historical herbarium 
samples collected from 1828 until 2011 (Fig. 1). 
With this design, we identify candidate agri- 
culturally adaptive alleles (i.e., those that occur 
at consistently higher frequencies in agricul- 
tural compared with nearby natural sites), track 
their frequencies across nearly two centuries, 
and link the tempo of weed adaptation to 
demographic changes and key cultural shifts 
in modern agriculture. 


The genome-wide signatures of 
agricultural adaptation 


To find alleles favored under current farming 
practices, we looked for those that were con- 
sistently overrepresented in extant populations 
collected in agricultural habitats compared with 
neighboring riparian (natural) habitats (25) 
using Cochran-Mantel-Haenszel (CMH) tests 
(Fig. 2A). Alleles associated with agricultural 
environments [the 0.1% of single-nucleotide 
polymorphisms (SNPs) with the lowest CMH 
P values; n = 7264] are significantly enriched 
for 29 gene ontology (GO) biological process 
terms related to growth and development; 
reproduction; cellular metabolic processes; 
and responses to abiotic, endogenous, and ex- 
ternal stimuli, including responses to chem- 
icals (table S1). The importance of chemical 
inputs in shaping weed agricultural adapta- 
tion is clear in that the most significant agri- 
culturally associated SNP {raw P value = 8.6 x 
10"; [false discovery rate (FDR)-corrected] 
Q value = 0.00062} falls just 80 kb outside the 
gene protoporphyrinogen oxidase (PPO)—the 
target of PPO-inhibiting herbicides (Fig. 2B). 
PPO-inhibiting herbicides were widely used in 
the 1990s and have had a recent resurgence 
to control and slow the spread of glyphosate- 
resistant weeds (26, 27). Other genes with the 
strongest agricultural associations include 
ACO1, which has been shown to confer oxi- 
dative stress tolerance (28); HB13, involved 
in pollen viability (29) as well as drought and 
salt tolerance (30); PME3, involved in growth 
through germination timing (37); CAMI, a regu- 
lator of senescence in response to stress (32, 33); 
and both CRY2 and CPD, two key regulators of 
photomorphogenesis and flowering through 
brassinosteroid signaling (34-37) (table S2). 
These changes are consistent with agricultural 
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Fig. 1. Sequenced waterhemp collections through space and time. (A) Map of 17 contemporary paired natural-agricultural populations [n = 187, collected and 
sequenced by Kreiner et al. (25)] along with 108 newly sequenced herbarium specimens dating back to 1828 collected across three environment types: agricultural 
(Ag), natural (Nat), and disturbed (Dist) (metadata provided in data S2). (B) Distribution of sequenced herbarium samples through time. 


selection to grow in high-stress and high- 
disturbance environments among fast-growing 
crops. Natural-versus-agricultural allele fre- 
quency differentiation (F'sy) is highly corre- 
lated with the CMH test statistic (Pearson’s 
correlation coefficient r = 0.987), with 78% (98%) 
of CMH focal SNPs overlapping with the top 
0.01% (0.1%) of Fsz hits (fig. S1). Despite neg- 
ligible genome-wide differentiation among en- 
vironments suggesting widespread gene flow 
(Fsr = 0.0008; with even lower mean Fr be- 
tween paired sites = —0.0029; Fig. 2C), our re- 
sults suggest that strong antagonistic selection 
acts to maintain spatial differentiation for par- 
ticular alleles, with 403 SNPs showing a CMH 
Q < 0.10 (251 after aggregating linked SNPs). 
To further investigate the extent to which 
herbicides shape adaptation to agriculture, 
we assayed patterns of environmental dif- 
ferentiation by known herbicide-resistance 
variants. Eight such alleles were present in 
contemporary samples, only six of which were 
common (table S3): a deletion of codon 210 
within PPO (38), a copy number amplifica- 
tion and a nonsynonymous mutation within 
5-enolpyruvylshikimate-3-phosphate synthase 
(EPSPS) conferring resistance to glyphosate her- 
bicides (39), and 3 separate nonsynonymous 
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mutations within acetolactate synthase (ALS) 
conferring resistance to ALS-inhibiting herbi- 
cides (79). Although these resistance alleles 
were at intermediate frequencies in agricul- 
tural populations, ranging from 0.08 to 0.35, 
they tended to be rarer but still frequent in 
natural populations, ranging from 0.04 to 0.22 
(Fig. 2C). Four of six common resistance alleles 
show significant allele frequency differences 
across environments (EPSPSamp: F = 9.02, 
P = 0.008; PPO210: F = 53.39, P = 1.04. 10"; 
ALS574: F = 4.95, P = 0.028; ALS376: F = 4.37, 
P = 0.038), two of which are among the strongest 
signals of differentiation genome-wide. Natural- 
versus-agricultural F's; at the PPO210 deletion, 
0.21, is higher than anywhere else in the ge- 
nome and is even stronger when calculated 
within population pairs (Fr = 0.27) (Fig. 2C). 
Similarly, the EPSPS amplification is ranked 
20th among genome-wide biallelic Fs; values 
at 0.14 (within-pair Fs; = 0.22), in support of 
herbicides as a foremost driver of agricultural 
adaptation (Fig. 2D). 

To infer the importance of selective trade- 
offs in adaptation across natural and agricultural 
environments, we implemented a Wright-Fisher 
allele frequency-based migration-selection bal- 
ance model for these four differentiated resist- 


ance alleles as well as the top 30 independent 
CMH outliers. Assuming that these alleles 
are at a steady state between migration and 
selection, we inferred that the costs of resist- 
ance per migrant that has arrived into natural 
environments are consistently higher than the 
benefits of resistance per migrant that has 
arrived into agricultural environments (per- 
migrant cost-to-benefit ratio ranges from 1.39 
for EPSPSamp to 5.03 for the PPO210 dele- 
tion; Fig. 2D and table S3). Thus, the spread of 
these four common herbicide-resistance alleles 
appears to be constrained either by more con- 
sistent selection against resistance in herbicide- 
free, natural environments or by particularly 
high rates of migration of susceptible alleles 
from natural into agricultural environments. In 
comparison, for the top 30 independent CMH 
outliers, the costs per migrant that has arrived 
in natural environments were about equally 
likely to be stronger or weaker (12/28, 42%) 
than the benefits per migrant in agricultural 
environments (fig. $2). This population genetic 
inference provides a previously unused and 
sensitive alternative to experimental studies of 
fitness costs that vary greatly depending on 
context (40), highlighting the potentially im- 
portant role of resistance costs across a diverse 


science.org SCIENCE 


RESEARCH | RESEARCH ARTICLES 


A oat 
@ 
210 
w 
v 
nae 
x 
= 
Oo 
o 
= 
7] 
Ss 
B= D ALS376" ALSS74" ALS653 _ EPSPS106 EPSPSamp*PPOdel* 
5 0 100 AFA=0.05 Faat.27 AFA 
= © Bat. CBa5. 
> _ o 
a’ Be | 
x 5B 
= Sc > ao om 
oO o@ 
= © 
oS 25 c= 
2 ° 


Genomic Position (Mb) 


Fig. 2. Signals of contemporary agricultural adaptation, gene flow, and 
antagonistic selection across the genome in A. tuberculatus. (A) Results 
from CMH tests for SNPs with consistent differentiation among environments 
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gene ALS. (C) Distribution of Fs; values between all agricultural and natural 
samples for ~3 million genome-wide SNPs (minor allele frequency > 0.05). 
Vertical lines indicate Fs values for the 10 candidate genes named in (A). 


across contemporary natural-agricultural population pairs. A 10% FDR (D) Population-level frequencies of six common herbicide-resistance alleles 


threshold is indicated by the lower dashed horizontal black line, whereas 
the Bonferroni-corrected P < 0.1 cut-off is shown by the upper dashed 


across geographically paired agricultural and natural habitats sampled 
in 2018 (pairs connected by horizontal lines). The first four columns are 


horizontal gray line. Red points indicate focal agricultural-associated SNPs nonsynonymous variants in ALS and EPSPS followed by EPSPSamp (a 10-Mb- 
after aggregating linked variation (r? > 0.25 within 1 Mb). Candidate scale amplification that includes EPSPS) and an in-frame single-codon 


agriculturally adaptive genes for peaks that are significant at a 10% FDR 
threshold are named. (B) CMH results from the scaffold containing the most 


deletion in PPO. Estimates of per-migrant natural cost-to-agricultural benefit 
ratio (C:B) are shown in the top right corner for the four resistance 


significant CMH P value, corresponding to variants linked to the PPO210 alleles with significant (indicated by asterisks) allele frequency differences 


deletion conferring herbicide resistance and to the nearby herbicide-targeted 


set of individuals within complex agricultural 
and natural environments. In these field set- 
tings, further work is necessary to understand 
the contributions of temporal and spatial het- 
erogeneity in both migration and selection for 
and against resistance across the landscape. 


Agriculturally adaptive alleles change rapidly 


With the genome-wide set of 251 modern 
agriculture-associated alleles, we searched for 
signatures of temporal evolution using newly 
collected whole-genome sequence data from a 
set of historical herbarium samples (n = 108) 
dating back to 1828. These samples provide 
snapshots of the genetic changes that have 
occurred over this time period and across envi- 
ronment types, with collections from natural 
and weedy (agricultural and disturbed) hab- 
itats (Fig. 1). Of the 165 loci for which we had 
sufficient information in the historical SNP 
set (sequenced to 10x coverage, on average), 
151 were segregating with the same reference- 
alternate allele combination (i.e., 11 were 
dropped because of multiallelism), and only 
three were invariant. To model allele frequency 
change through time at these alleles, we im- 
plemented logistic regressions of genotypes 
(within-individual allele frequencies) at each 
locus by collection year, where twice the slope 
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of the logit-transform is equivalent to the 
strength of selection (s) in a diploid model of 
selection [where s is the fitness difference be- 
tween homozygotes, assuming additivity; see 
materials and methods for model and simu- 
lations (41)]. 

Consistent with the rapid change in land 
use and farming practices in the recent past, 
the frequency of these 154 contemporary agri- 
cultural alleles has increased substantially over 
the past two centuries. Whereas in natural 
environments agriculturally associated alleles 
have increased by 6% on average since 1870— 
the earliest time point at which we have col- 
lections across environment types—these same 
alleles have increased by 22% in disturbed and 
agricultural environments (Fig. 3A). This ob- 
served change greatly exceeds the expected 
change over this time period, based on genome- 
wide patterns that reflect drift, migration, se- 
lection, and demographic change [null 95% 
interquantile range for allele frequency change 
in natural sites = (-2.7, 2.0.%); for change in 
agricultural and disturbed sites = (3.3, 7.9%)]. 
We generated these null expectations by ran- 
domly sampling a set of 154 loci with the same 
distribution of contemporary allele frequencies 
(fig. S4) and calculating their frequency change 
through time across herbarium samples, sep- 


(AFA) across environment types in a multiple linear regression. 


arately in each environment, 1000 times [see 
materials and methods (47)]. That the observed 
change in natural environments is also more 
extreme than what is expected is consistent 
with ongoing migration of agriculturally se- 
lected alleles and subsequent costs in natural 
environments. 

The considerable increase in frequency of 
these alleles across environments corresponds 
to notably strong selection, even when esti- 
mated over century-long time periods. The 
154 agriculture-associated alleles collectively 
exhibit a selective strength of s = 0.022 since 
the 1870s in agricultural and disturbed habi- 
tats. However, these alleles exhibit much weaker 
selection, § = 0.0056, in natural habitats [agri- 
cultural and disturbed null interquantile range = 
(0.0026, 0.0068); natural null interquantile 
range = (-0.0018, 0.0018)]. An open question 
in evolutionary biology is what distribution 
of selection coefficients underlie adaptation 
(42). We estimate that selection on agricultural- 
associated loci varies between —0.196 and 0.150 
in natural habitats and between -0.090 and 
0.372 in agricultural and disturbed habitats, 
reflective of left- and right-skewed distribu- 
tions, respectively (Fig. 3B and fig. S5). The 
top 15 agriculture-associated alleles that we 
infer have experienced the strongest selection 
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Fig. 3. Genomic signatures of agricultural adaptation through time. 

(A) Agricultural allele frequency trajectories for each of the 154 focal SNPs 
in agricultural and disturbed habitats (left) and in natural habitats (right). 
Trajectories are colored by the empirical range of the allele frequency change 
quantile in agricultural and disturbed habitats. Transparent lines indicate 
those with nonsignificant evidence of selection at a = 0.05 after FDR = 10% 
correction. (B) The distribution of selective strengths on agricultural alleles 
in natural (dark gray) and agricultural and disturbed (light gray) habitats 


(over both loci and individuals) of allele frequencies, whereas dots represent 
raw genotype data for each locus and sample from which the allele frequency 
trajectory is estimated. Cropland use per capita in North America data 

are from (1), reflecting the intensity of agricultural practices. (D) The 
trajectory of alleles at known herbicide-resistance loci through time, fit by 
logistic regression for each of the biallelic resistance alleles present in our 
contemporary data (excluding EPSPSamp with its complex allelic structure). 
Dots represent genotypes for each historical and contemporary sample at 


over the past ~150 years include SNPs that map 
near PPO, ACO1, CCB2, WRKY13, BPL3, and 
ATPD (table S4). We find that both the total 
frequency change of agriculture-associated 
alleles and the estimated strength of selection 
in agricultural and disturbed environments 
are positively correlated with the extent of 
contemporary linkage disequilibrium around 
these loci (the number of SNPs within 1 Mb 
with r* > 0.25) (frequency change: F = 5.16, P = 
0.024, 7 = 0.12; strength of selection: F = 3.99, 
P = 0.048, r = 0.058; fig. S6), consistent with 
theoretical expectations for the genomic sig- 
natures of recent positive selection (43, 44). 
We next investigated how well the trajectory 
of modern agricultural alleles reflects the rise 
of industrialized agricultural regimes over the 
past century. When we split our samples into 
those that predate versus those that come after 
the intensification of agriculture during the 
Green Revolution, we find that the increase in 
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between 1870 and 2018. (C) Environment-specific agricultural allele 
frequency trajectories before and after the start of agricultural intensification 
n 1960 (vertical dashed line). Large circles represent moving averages 


each resistance allele. 


frequency of agricultural alleles was negligible 
in agricultural and disturbed environments be- 
fore the 1960s (predicted 1870-to-1960 change = 
0.005). By contrast, change subsequent to 1960 
nearly completely accounts for the observed 
rise in frequency of modern agricultural alleles 
(predicted 1960-to-2018 change = 0.219 versus 
total 1870-to-2018 change = 0.221) (Fig. 3C). 
Corresponding estimates of selection by logistic 
regression using only data from before 1960 
show no evidence of selection on these loci in 
disturbed and agricultural habitats [s = 0.0008, 
null interquantile range = (—0.0044, 0.0020)] 
or in natural habitats [s = 0.0006, null inter- 
quantile range = (-0.004, 0.004)]. However, 
samples collected after 1960 reflect a marked 
shift in selection—a collective s = 0.054 in 
disturbed and agricultural environments and 
a collective s = 0.028 in natural environments 
[agricultural and disturbed null interquantile 
range = (0.0064, 0.0020); natural null inter- 


each herbicide-resistance locus. The 95% Cls of the maximum likelihood 
estimates of selection between 1960 and 2018 are provided in the legend for 


quantile range = (—0.0056, 0.0054)] (Fig. 3C 
and fig. S8). Together, these results suggest 
that although most contemporary agricultural 
alleles were present in historical populations, 
these alleles only became associated with agri- 
cultural and human-managed sites over the past 
century, on time scales and at rates consistent 
with the rapid uptake and intensification of 
agrochemicals, controlled irrigation, and mech- 
anization in agriculture. 

The historical trajectory of known herbicide- 
resistance alleles epitomizes extreme selec- 
tion over the past 50 years (Fig. 3D). Five of 
seven known biallelic herbicide-resistance 
alleles present in our contemporary, paired- 
environment collections are absent from our 
historical samples, consistent with the sug- 
gested importance of resistance adaptation 
from de novo mutation (13, 45) and a particu- 
larly recent increase in their frequency. Only 3 
of 108 historical samples show variation for 
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Fig. 4. Temporal shifts in the distribution of var. rudis ancestry have 
facilitated polygenic agricultural adaptation. (A) Longitudinal clines in var. 
rudis ancestry over three time spans, illustrating the expansion of var. rudis 
ancestry eastward over the past two centuries. In (A) to (C), dots represent 
individual-level ancestry estimates. (B) The distribution of individual-level 
var. rudis ancestry by state and through time, illustrating state-specific 


changes in ancestry. Horizonal lines within each distri 


herbicide resistance—two samples homozygous 
for resistance at ALS574 and one heterozygous 
for resistance at ALS122—all of which were 
sampled after the onset of herbicide applica- 
tions in the 1960s (Fig. 3D). Resolving the very 
low historical and much higher contemporary 
frequencies of resistance, we estimate that 
since the approximate onset of herbicide use 
in 1960, these seven resistance alleles have 
collectively experienced a selective strength 
of § = 0.198 (logistic Z value = 2.11, P = 0.035) 
per year across environment types. Maximum 
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100 kb average 


bution represent first and 


likelihood-based estimates of selective strengths 
for each resistance allele are significant for five 
of the seven and are strongest for PPO210 (s > 
0.194), EPSPS106 (s > 0.106), and ALS574 (s > 
0.088) (Fig. 3D and table S3). 


Concurrent temporal shifts in ancestry 
underlie agricultural adaptation 


Finally, we explored whether historical de- 
mographic change over the past two centuries 
has played a role in agricultural adaptation. 


Early taxonomy described two different 


100kb average 


third quartiles of ancestry. (C) Increasing sorting of individual-level var. rudis 
ancestry into agricultural environments (Env.) on contemporary time scales. 
(D) Environment-specific metrics of selection [CMH P value and cross-population 
extended haplotype homozygosity (XPEHH)] across the genome in 100-kb 
windows positively correlate with var. rudis ancestry in agricultural but not 
natural habitats (XPEHH by environment: F = 10.97, P = 9.3 x 10-4; CMH by 
environment: F = 108.51, P < 10°"). 


A. tuberculatus varieties as separate species, with 
few distinguishing characteristics [seed dehis- 
cence and tepal length (16)]. Sauer’s 1955 revision 
of the genus, which used herbarium speci- 
mens to gauge the distribution and migration 
of congeners over the past two centuries 
(46), led him to describe an expansion of the 
southwestern var. rwdis type [at the time, 
A, tamariscinus (Sauer) ] northeastward into the 
territory of var. tuberculatus [A. tuberculatus 
(Sauer)] sometime between 1856 to 1905 and 
1906 to 1955. Our sequencing of >100 herbarium 
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samples dating back to 1828, combined with 
349 contemporary sequences (25, 47), allowed 
us to directly observe the change in the distri- 
bution of these two ancestral types, adding 
further temporal resolution to Sauer’s mor- 
phological observations of the species’ range 
shifts, and to assess the role of agriculturally 
adaptive standing genetic variation across 
varieties. 

Range-wide, we see clear shifts in the dis- 
tribution of var. rudis ancestry based on 
fastSTRUCTURE (48) inference at K Number 
of subpopulations) = 2 (fig. S9) across three 
time spans—1830 to 1920, 1920 to 1980, and 
1980 to 2018 (time span: F = 5.47, P = 0.0045)— 
and particularly so in the east (time span x 
longitude: F = 5.49, P = 0.0045), consistent 
with a recent expansion of var. rudis ancestry 
(Fig. 4A). Furthermore, we see strong state- and 
province-specific shifts in ancestry through time 
in our historical sequences (time span by state 
interaction: F = 4.22, P = 7 x 10°), highlighting 
not only the shift of var. rwdis eastward (with 
increases through time in Ontario, Ohio, Illinois, 
and Missouri) but also the recent introduc- 
tion of var. tuberculatus ancestry into the most 
western part of the range in Kansas (Fig. 4B). 
A. tuberculatus demography thus appears to 
have been markedly influenced by human- 
mediated landscape change over the past two 
centuries, consistent with the massive recent 
expansion of effective population size that we 
had previously inferred from contemporary 
samples over this same time frame (45). That 
this shift has been most notable over the 
past 40 years is further consistent with the 
time scale of agricultural intensification, 
shifts toward conservation tillage, and ram- 
pant herbicide-resistance evolution within the 
species (19, 45, 49, 50), which suggests that 
selection may facilitate the colonization of var. 
rudis ancestry outside its historical range. 
Along these lines, we find that this contem- 
porary range expansion has facilitated the 
sorting of var. rudis ancestry across environ- 
ments (a longitude by time span by environ- 
ment interaction: F = 5.13, P = 4x 10~°; Fig. 
4C), with increasing overrepresentation of var. 
rudis ancestry in agricultural and disturbed 
environments in the eastern portion of the 
range through time, as has been previously 
suggested (25). 

To investigate whether agricultural adapta- 
tion has drawn disproportionately from var. 
rudis ancestry, we reconstructed fine-scale 
ancestry across the genome. On the basis of 
analyses in 100-kb windows, we find a least 
squares mean of 5.5% [95% confidence inter- 
val (CI) = (5.0, 5.9%) ] more var. rudis ancestry 
genome-wide in agricultural environments 
compared with the adjacent natural habitat 
(fig. S10). The environment-specific propor- 
tion of var. rwdis ancestry is not only positively 
correlated with recombination rate (F = 18.85, 
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P=14.x 10°, r = 0.056) and gene density (F = 
8.53, P = 0.004, 7 = 0.050) but also with SNP 
and haplotype-based evidence of environment- 
specific selection. Agricultural but not natural 
populations have an excess of cross-population 
haplotype homozygosity (agricultural versus 
natural XPEHH) and within-pair environmen- 
tal differentiation (CMH P value) in genomic 
regions of high var. rudis ancestry (XPEHH by 
environment: F = 10.97, P = 9.3 x 10-*; CMH by 
environment: F = 108.51, P < 107"; Fig. 4D), 
which implies that ancestry composition 
genome-wide in large part determines the 
extent of polygenic agricultural adaptation. 
These findings suggest that the expansion of 
var. rudis ancestry across the range, partic- 
ularly in the past 40 years, has facilitated 
waterhemp’s success in agricultural habitats 
by providing access to preadapted, standing 
genetic variation. 


Discussion 


Agricultural adaptation in A. twberculatus, a 
native plant in North America, has occurred 
over extremely short time scales, facilitated by 
range shifts in response to the agriculturaliza- 
tion of its native habitat. The human-mediated 
expansion of the southwestern lineage of the 
species northeastward since the latter half of 
the 20th century has introduced new genetic 
variation across the range, on which selection 
in agricultural settings could act. Negligible 
genome-wide differentiation across habitats 
in this species refutes the idea of agricultural 
populations existing as separate to natural eco- 
systems (57). Despite substantial gene flow, 
the prevalence of agricultural alleles has in- 
creased rapidly since the intensification of 
agriculture over the past 60 years—in agricul- 
tural environments by a selection coefficient 
of nearly 6% per year and even in natural hab- 
itats by >2% per year. The selective intensity of 
industrial agriculture is on par with the inten- 
sities that Arabidopsis populations in extreme 
hot and dry environments are predicted to face 
by 2070 under the high-emissions scenario of 
climate change (52). The effects of agricultural 
herbicides are even more extreme—range-wide, 
evolved resistance mutations have experienced 
selective strengths of 20% on average per 
year since herbicides were first introduced— 
permeating even into natural habitats. 
Although modern, industrial agriculture im- 
poses strengths of selection rarely observed in 
the wild, A. tuberculatus has in turn escalated 
the weed management-evolution arms race 
through a multitude of interdependent mech- 
anisms: range expansion, polygenic adaptation 
from standing genetic variation, and large- 
effect herbicide-resistance mutations. Together, 
these results highlight that anthropogenic 
change not only leads to the formation of new 
habitats but also provides an opportunity for 
range expansion that may facilitate and in- 


teract with local adaptation, reshaping genetic 
variation for fitness within native species. 
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Catalytic asymmetric C-H insertion reactions of 


vinyl carbocations 
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From the preparation of pharmaceuticals to enzymatic construction of natural products, carbocations 
are central to molecular synthesis. Although these reactive intermediates are engaged in stereoselective 
processes in nature, exerting enantiocontrol over carbocations with synthetic catalysts remains 
challenging. Many resonance-stabilized tricoordinated carbocations, such as iminium and oxocarbenium 
ions, have been applied in catalytic enantioselective reactions. However, their dicoordinated counterparts 
(aryl and vinyl carbocations) have not, despite their emerging utility in chemical synthesis. We report 
the discovery of a highly enantioselective vinyl carbocation carbon—hydrogen (C-H) insertion 
reaction enabled by imidodiphosphorimidate organocatalysts. Active site confinement featured in 
this catalyst class not only enables effective enantiocontrol but also expands the scope of vinyl 
cation C-H insertion chemistry, which broadens the utility of this transition metal-free C(sp°)-H 


functionalization platform. 


ince the late 19th century, carbocations 
(cationic molecules that feature an even 
number of electrons with a positive charge 
residing on one or more carbon atoms) 
have fascinated organic chemists because 
of their fundamental properties and potent 
modes of reactivity (7). From agostic bonding, 
which is critical to our understanding of chem- 
ical structure (2, 3), to biosynthesis, in which 
carbocations facilitate the assembly of stereo- 
chemically complex natural products (4, 5), 
these reactive intermediates have played an 
essential role in our understanding of chem- 
istry. Despite the relevance of carbocations to 
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many modern synthetic processes, controlling 
the regio-, chemo-, and stereoselectivity of their 
reactions has proven challenging (6). This is 
especially true in asymmetric catalysis, in which 
high-energy, prochiral cationic intermediates 
must proceed through energetically differen- 
tiated diastereomeric transition states (TSs) 
to obtain synthetically useful enantiomeric ex- 
cesses (ee). Achieving an energy difference 
(AAG") of >2.0 kcal/mol (which is required 
for >90% ee at 70°C) is a daunting task for 
such high-energy intermediates for which 
productive bond formation can proceed via 
relatively low energetic barriers (Fig. 1A). More- 
over, the propensity for such intermediates 
to engage in unproductive side reactions fur- 
ther complicates the development of selective 
processes. This fundamental challenge is mani- 
fested in modern carbocation chemistry, in 
which high levels of selectivity are only rou- 
tinely realized in the reactions of resonance- or 
heteroatom-stabilized carbocationic species 


(6-10). The asymmetric reactions of carbocations 
that are not stabilized by 1-resonance remain 
rare (11). 

Unlike resonance-stabilized tricoordinated 
carbocations, dicoordinated carbocations, such 
as aryl and vinyl cations, have thus far been 
excluded from the field of asymmetric catal- 
ysis (Fig. 1B). This is likely due to the lack of 
catalytic methods for their generation and 
their high reactivity once formed (12). Recent 
reports that these intermediates undergo fac- 
ile insertion into unactivated sp? C-H bonds 
(13-20), which often creates stereogenic car- 
bons, prompted us to investigate their appli- 
cation in asymmetric catalysis. In addition to 
the fundamental challenges associated with 
engaging a high-energy species in catalytic 
enantioselective reactions [in some cases, di- 
coordinated cations are computed to undergo 
almost barrierless insertion into alkane C-H 
bonds (15, 16, 20)], most catalytic C-H inser- 
tion reactions of dicoordinated carbocations 
have required the use of weakly coordinat- 
ing anions (WCAs). These anions are notable 
in their weak basicity and nucleophilicity, 
which avoids deleterious E1 elimination or 
Syl nucleophilic substitution reactions (27). 
However, the few chiral WCAs that have been 
reported have not been successfully used in 
asymmetric catalysis (22, 23) because the high- 
ly ordered TSs that are required for enantio- 
selectivity almost always rely on directional 
coordinative interactions. Inspired by the ex- 
quisite stereocontrol over reactive carbocations 
manifested within a confined enzyme active 
site (24) and key synthetic studies on con- 
finement effects in selective catalysis (25, 26), 
we hypothesized that confinement of a dicoor- 
dinated carbocation intermediate could direct 
its ensuing reactivity with enantiocontrol. In 
this work, we report the discovery of a highly 
enantioselective C-H insertion reaction of vinyl 
carbocations enabled by imidodiphosphorimi- 
date (IDPi) strong acids pioneered by List et al. 
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A Reactive intermediates & reaction selectivity 


energy 


a 
= 7 5 
r unproductive productive 


reaction 


High energy intermediate: 


stabilized 


f-S, 4 


+ sp*-hybridized 
* resonance & hyperconjugation 
* many methods for generation 
* known in asymmetric catalysis 


challenging enantiocontrol, facile side reactions 


C This work: enantioselective vinyl cation C-H insertion reactions 


— 
“Yt -@— 


OTs H 
Li 


ist’s IDPi 


xX 
H 
H 


>20 examples 


up to 95% yield, 
>20:1 d.r. & 93% ee 


* enantiocontrol over a dicoordinated vinyl carbocation 


* organocatalytic asymmetric functionalization of unactivated C(sp?)—H bonds 


tricoordinated carbocation 


carbenium 


B Carbocation coordination number and associated properties 


dicoordinated carbocation 


vinyl cation aryl cation 


- high s orbital character 
* minimized resonance & hyperconj. 
* challenging formation 
* unknown in asymmetric catalysis 


Fig. 1. Selective reactions of carbocations. (A) Challenges associated with high-energy intermediates in selective catalysis. (B) Properties associated with 


carbocation coordination number. (C) Discovery of enantioselective C—H insertion reactions of vinyl cations. hig! 


intermediate; OTs, p-toluenesulfonate; X, NSO2CF3 or CH. 


(27, 28), which results in an organocatalytic 
platform for stereoselective functionalization 
of unactivated C(sp®)-H bonds (Fig. 1C). 

Our initial investigations focused on the 
intramolecular desymmetrization of a piper- 
idinyl fragment via C-H insertion to generate 
bicyclic structures, given the prevalence of 
nitrogen-containing polycyclic scaffolds in nu- 
merous bioactive natural products (29). After ex- 
ploration of various catalyst structures (Table 1), 
we were able to catalyze the transformation of 
vinyl tosylate 1a to the cyclized C-H insertion 
product 2a with high ee using IDPi 8B and 
stoichiometric allyltriisopropylsilane (entry 6). 
This C-H functionalization reaction forges 
three contiguous stereocenters with notable 
diastereoselectivity [>20:1 diastereomeric ratio 
(d.r.)] and reasonable conversion after 72 hours. 
We posit that the reaction is sluggish because 
of a high barrier for vinyl carbocation forma- 
tion and therefore is directly dependent on the 
Lewis acidity of the silicon species (12). Thus, 
we hypothesized that the conversion could be 
improved by tuning the silyl group on the allyl 
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silane, given that the silylated IDPi is likely the 
active catalyst (see below). We were inspired 
by Lambert’s studies on the effects of steric 
bulk on silylium ion coordination, in which 
tris(trimethylsilyl)silylium [Si(TMS)3*] paired 
with a WCA generated a near-trivalent silylium 
cation, in contrast to trialkyl silylium cations 
that form tetravalent Si centers by coordination 
to solvent or counteranion (30). Gratifyingly, in 
our system, we observed a positive correlation 
between silane size and activity (entries 6 to 8 
in Table 1), in which tris(triethylsilyl) allylsilane 
resulted in full conversion without any influence 
on enantioselectivity. We performed Gutmann- 
Beckett studies that supported the enhanced 
Lewis acidity of tris(triethylsilyl)silane [Si(TES)s|- 
silylated IDPi relative to its trialkyl silyl IDPi 
counterpart, although other effects cannot be 
ruled out (see supplementary materials). 
With this improved activity and encourag- 
ing enantioselectivity on our model substrate, 
we explored the scope of this reaction (Fig. 2). 
The transformation proved compatible with 
substitution at both of the aryl rings on the 


h E int, high-energy intermediate; low E int, low-energy 


substrate, delivering insertion products in mod- 
erate to good yield with excellent enantioselec- 
tivity (up to 93% ee) and diastereoselectivity 
(>20:1 d.r.). High levels of enantioselectivity 
were maintained with ortho substitution near 
the insertion site (2g and 2i), with electron- 
rich (2f, 2g, and 20), electron-deficient (2e, 
2n, and 2p), and heterocyclic (2j and 2k) 
substrates also performing well. Moreover, 
functional groups labile in many transition 
metal-catalyzed processes (2p to 2s) were 
compatible, which highlights this method’s 
complementarity to transition metal-catalyzed 
C(sp?)-H functionalization platforms. C-H in- 
sertion adjacent to a sterically congested all- 
carbon quaternary center (2m) occurred in good 
yield, albeit with moderate enantioselectivity. 
An additional highlight was the conversion 
of an all-hydrocarbon variant of the model 
substrate to bicyclo[4.3.0]nonenes in modest 
stereoselectivity (2v). The enantiopurity of 
many of the piperidine insertion products 
could be readily upgraded via a single re- 


crystallization to afford enantiopure material 
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(>99% ee). Moreover, the N-triflyl protecting 
group could be removed with sodium bis(2- 
methoxyethoxy)aluminum hydride (Red-Al) 
to afford free amine 2w (31). Heating in- 
sertion product 2a with acid isomerized the 
olefin to the thermodynamically more favored 
tetrasubstituted olefin isomer (2x), without 
loss of enantioenrichment. WCA catalysts that 
were previously used in our laboratory in vinyl 
cation C-H insertion reactions resulted in low 
yields (<10%) or intractable product mixtures 
because of the formation of multiple isomers, 
oligomeric products, and other unidentifiable 
side products (see supplementary materials). 
These findings, when compared with previ- 
ous examples of catalytic vinyl cation C-H 
insertion reactions, highlight a compelling 
advance in substrate compatibility and cat- 
alyst control. 


Confirming the intermediacy of the vinyl cation 


Although catalytic vinyl cation formation via 
sulfonate abstraction has been previously re- 
ported (15, 16), we sought to further support 
its intermediacy in this system. To this end, 
we found that both the Z- and E-vinyl tosylate 
isomers afforded the same insertion product 
with identical enantioselectivities, which sup- 
ports the common intermediacy of a linear 
vinyl carbocation (Fig. 3A). Moreover, we pre- 
pared tosylate 9 (which cannot undergo in- 
tramolecular C-H insertion) and found that 
allene 10 is produced as the major product un- 
der the reaction conditions, which likely arises 
from a Wagner-Meerwein rearrangement— 
a known reactivity pathway of vinyl carboca- 
tions (12). 


OTs IDPi (12 mol%) 
a5 SS allyl silane (1.3 equiv) 
Ph NTf cyclohexane (0.2M) 
65 °C, 72h 
1a 
Entry IDPi X Silane Conv. 
1 3A H allyl TIPS 70% 
2 4B CF3 allyl TIPS 89% 
3 5B CF3 allyl TIPS 95% 
4 6B CF3 allyl TIPS 13% 
5 7B CF; allyl TIPS 93% 
6 8B  CF3 allyl TIPS 81% 
7 8B CF3_~ allyl TMS 38% 
8 8B CF3_ allyl Si(TES)3 full 
9 8B CF3 none 0% 


In the context of enantioselective insertion, 
the allyl silane did not influence the selectivity 
of the reaction, despite its profound influence 
on rate. This result not only supports the pro- 
posed role of the Si-center in the ionization 
of the substrate but also suggests that after 
ionization, a discrete vinyl cation-IDPi ion pair 
is formed that undergoes the enantiodeter- 
mining C-H insertion step. Supporting this 
hypothesis, deuteration of the substrate’s in- 
sertion sites resulted in an appreciable in- 
crease in % ee across two structurally distinct 
catalysts (Fig. 3B), which corresponds to up to 
a 0.12 + 0.03 kcal/mol difference in AAG* (32). 
Moreover, no indication of deuterium incor- 
poration adjacent to the CD, methylene was 
observed, which would be expected if a step- 
wise hydride rebound mechanism was opera- 
tive (15) given that hydride shifts would result 
in H/D scrambling (Fig. 3C). Additionally, 
substrate 1u cleanly forms insertion product 
2u in high yield, with no sign of rearrange- 
ment. Although these observations support 
either a concerted insertion or a rapid rebound 
mechanism, in which a putative 2° carbocation 
is trapped by olefin attack before facile 1,2-shifts 
can occur, we recognize that the mechanism 
likely resides on a continuum between these 
two extreme scenarios (see below). On the basis 
of these results, a proposed mechanism (Fig. 
3D) initiates with protodesilylation of the 
allylsilane to generate the active Lewis acid 
11, which is consistent with prior work by 
List et al. (27, 33). Subsequent ionization of 
the vinyl tosylate by 11 generates the reactive 
vinyl cation species (12), which we propose 
is confined within the IDPi anion as a con- 


TfN 
H 
H 2a 
Tol 
>20:1 d.r. i 
Yield yee 
56% 52% 
79% 85% 
72% 60% 
11% 84% 
84% 85% 
72% 91% 
34% 89% 
91% (81%) 91% 


0% = 


tact ion pair. An enantiodetermining C-H 
insertion step followed by kinetic deproto- 
nation delivers insertion product 2 and re- 
generates IDPi. 


Modeling the TS 


To probe the specific interactions responsi- 
ble for the experimentally observed stereo- 
selectivity, density functional theory (DFT) 
calculations at the wB97X-D/6-311++G(d,p), 
CPCM(cyclohexane)//B3LYP-D3/6-31G(d,p), 
CPCM(cyclohexane) level were carried out. 
Two diastereomeric TSs that used substrate la 
and catalyst 8B were computed and found to 
correspond with a concerted, asynchronous 
C-H insertion of the vinyl cation with an en- 
ergy difference (AAG') of 2.1 kcal/mol, which 
favored the experimentally observed enantio- 
mer (Fig. 3E). In the minor TS, the sterically 
encumbering piperidine is situated in much 
closer proximity to the congested p-SF;, substit- 
uents of the catalyst scaffold. These repulsive 
interactions obstruct the optimal positioning 
of the substrate within the anionic pocket of 
the catalyst, which ultimately weakens the 
intermolecular interactions between the asso- 
ciated ions. The difference in electrostatic sta- 
bilization between the two TSs was calculated 
to be 0.6 kcal/mol (34) with an additional 
1.4 kcal/mol of stabilization in the favored 
enantiomeric TS arising from dispersive inter- 
actions, which together correspond to enhanced 
binding of the C-H insertion TS by the IDPi 
anion if the preferred enantiomeric conforma- 
tion is adopted. Careful inspection of the TS 
structure reveals bending of the vinyl cation, 
placing the tolyl group away from the bulky 


Table 1. Reaction optimization. Yields determined by 'H NMR with an internal standard. *lsolated yield on 0.1 mmol scale. conv., conversion of starting material: TIPS, 


triisopropylsilane; TMS, trimethylsilane. 
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IDPi (8B): 
OTs IDPi (8B) (12 moi%) TN 
NS allyl Si(TES)3 (1.3 equiv) H 
_ 
NTf cyclohexane (0.2M) 
60-75 °C, 72h 
f 2 
Substrate scope: 
TIN ? TIN 


2a, 81% yield 
>20:1 d.r. 
91% ee (>99% ee)" 


TfN TiN 
H H 
> 
(sm 
MeO 


2g", 54% yield 


X-ray structure 
absolute stereochemistry 


2f', 71% yield 


82% ee (>99% ee) 


2c, 81% yield 
>20:1 dr. 
92% ee (>99% ee)" 


2d, 95% yield 
>20:1 a.r. 
88% ee (>99% ee)" 


TiN 
H 
H 
Y | Ph 
Ss 


2fS, 40% yield 


2b*, 70% yield 
>20:1 d.r. 


TfN TfN 


fe) 


2h, 66% yield 2i, 89% yield 


2k, 65% yield 


F 


2e', 57% yield 
>20:1 a.r. 


87% ee (99% ee)" 


TfN 


>20:1 d.r. >20:1 ar. >20:1 a.r. >20:1 d.r. >20:1 dr. >20:1 d.r. 
93% ee 87% ee 82% ee 88% ee 90% ee 88% ee 
TiN TIN TfN TiN TiN 
H TiN a H H i 
ee, "I ye "TY a) 
Ph 7 PMP PMP Tol Tol 
oy” « «) «) C) 
F OMe | BPin 
2I', 70% yield 2m‘, 60% yield 2n', 78% yield 20', 60% yield 2p, 54% yield 2q5, 60% yield 
>20:1 d.r. >20:1 d.r. >20:1 a.r. >20:1 a.r. >20:1 d.r. >20:1 d.r. 
81% ee 92% ee (>99% ee)" 91% ee (>99% ee) 93% ee (>99% ee) 90% ee 90% ee 
HN 
TIN TIN : seal ‘ 
H H a 
uaternar, 
H & H S TIN TNO PhMe,rt so 
Tol Tol H Me H 83% yield Tol sh 
H H H 2w 
PMP 
@) C) Pn oh Ee ke TsOH:H,o ~—'N 
I OH H 
PhMe, 90 °C H 
2r, 66% yield 2s’, 64% yield 26, 40% yield 2utS, 81% yield — 2v’"", 72% yield 66% yield toy 
>20:1 d.r. >20:1 a.r. >20:1 d.r. >20:1 d.r. 3.5:1 ar. (100% es) Ph 
91% ee 87% ee 75% ee 55% ee 57% ee 2x 


Fig. 2. Substrate scope. All reported yields are isolated yields of purified product. *After single recrystallization. {96 hours at 75°C. +0.1M. §70°C. 960°C. #2.3 equivalents 


of silane used because of alcohol silylation; the crude reaction was then stirred with tetrabutylammonium fluoride. **0.025M using 3A. BPin, pinacol boronic ester; 
es, enantiospecificity; Me, methyl; Ph, phenyl; PhMe, toluene; PMP, p-methoxyphenyl; rt, room temperature; Tol, p-tolyl; TSOH, p-toluenesulfonic acid. 
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A Evidence of viny! carbocation intermediacy 
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PH Ph 474% PH a 
9 yield 10 
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B Isotopic labeling studies 
x TfN . 
xX x 
H 
TsO NTf conditions x 
— Se 
PH = PhX x Ph bh 
1b-H/D 2b-H/D 
8B 3A 


=H 82.07 + 0.69 % ee 
=D 84.82 + 0.16 % ee 


A(%ee) 2.75 40.70% ee 
AAAG# 0.12 + 0.03 kcal/mol 


30.22 + 1.13 % ee 
36.01 + 0.16 % ee 


5.78 + 1.14 % ee 
0.09 + 0.01 kcal/mol 


C Potential pathways if rebound mechanism is operative 


Tf 
HN 
NK) D no H/D 
Sohds ae OX D scrambling 
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Ph Ph 
i Tt 
y N 
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Zw SITES) 
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IDPi= @-H catalyte cycle Sf 
O.5 


TIN an 12 
a TfN 
H 
; H enantiodetermining 
deprotonation 7 me C-Hinsanion 
2 2 "@ 
13 


E DFT analysis of insertion TS 


electrostatic potential 
surface map 


Substrate 1a w/ IDPi 8B 


i Major 
AAG = 0.0 kcal/mol 


observed: 91% ee 
predicted: 92% ee 


Minor 


O kcal/mol — +3.9 kcal/mol 


inaccessible 1,2-H shift 


Fig. 3. Mechanistic studies. (A) Evidence for vinyl carbocation intermediacy. (B) Effect of isotopic labeling on enantioselectivity. Listed values are the average of 
triplicate runs. (©) Rearrangement products from a rebound mechanism that were not observed. (D) Proposed mechanism. (E) DFT-calculated diastereomeric 

TS structures with corresponding bond lengths (A), electrostatic potential surfaces, and free-energy differences between major and minor TSs. Substrate atoms 
are rendered as a stick model, and the catalyst is rendered as a space-filling model. Electrostatic potential areas are colored red to indicate a more-negative potential 
and blue to indicate a more-positive potential. (F) Benzylic stereocenter epimerization is not likely. Et, ethyl. 
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—£ 4-fold = 0.77 
8 is 
x 
- 
g 
= 10 
3 
2 
B os 
a 

0.0 x X 


Training Set 
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B Parameterization of truncated catalysts and substrates 


OTs 
WN 
NMRc X 


C PEOE;p» Classification: X = 


15.9 7.95 

N-Tf N-Ts N-TFA 
N-SO,-3,5-CF3Ph 

8.32 OTs eile 

CF, S 


Fig. 4. Statistical modeling. (A) Multivariate linear regression model with a pseudorandom 50:50 
partitioning of the 91 data points into training set:validation set. (B) Visual representation of the molecular 
descriptors used in the model. (C) Substrate classification of X identity on the basis of PEOE;z. 


N-Tf piperidine ring (Tf, trifluoromethyl- 
sulfonyl), which is consistent with an anti relative 
orientation seen in the experimentally obtained 
diastereomer. Although product 2a was com- 
puted to be energetically favored over its ben- 
zylic epimer (14; Fig. 3F), thermodynamically 
driven equilibration after C-H insertion is 
likely not the driver of diastereoselectivity. 1,2- 
hydride shifts are known to be stereospecific 
(35), which rules out benzylic hydride exchange 
as a potential mechanism for epimerization 
of the benzylic stereocenter (Fig. 3F), and hy- 
dride shifts from the ring fusion carbon are 
discounted given the lack of H/D scrambling 
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observed in deuterated substrate Ib-D, (Fig. 
3C). Taken together, these results are consis- 
tent with a 1,1-insertion step that sets all three 
stereocenters concurrently with high stereo- 
control (see supplementary materials for fur- 
ther discussion). 

To further explore factors governing how 
catalyst and substrate structure affected the 
enantioselective outcomes, we investigated 
the reaction holistically by using statistical 
modeling to analyze the diverse reaction out- 
puts recorded within the optimization and 
substrate scope studies. This was accomplished 
by regressing computationally derived molec- 


ular descriptors to the measured enantiose- 
lectivities (36). Substrates and truncated IDPi 
catalysts (see supplementary materials) were 
independently subjected to conformational 
searches by using molecular mechanics, and 
the relevant structures were submitted for 
DFT optimization at the PBEIPBE-D3BJ/def2- 
SVP level of theory with M06-2X/def2-TZVP 
single-point energy corrections. Both steric 
and electronic molecular descriptors were ex- 
tracted from the lowest energy conformer of 
the computed ensembles (37). The data were 
split into a training and validation set by using 
a pseudorandom 50:50 partition followed by 
forward stepwise multivariate linear regres- 
sion analysis. A four-parameter model [coeffi- 
cient of determination (R®) = 0.85] was found in 
which (Fig. 4A) two features for both the catalyst 
and substrate are required for a good correlation 
(Fig. 4B). Cross-validation techniques, includ- 
ing leave one out (LOO; 0.79) and k-fold (k = 4, 
0.77), aS well as external validation [predicted 
R? (preaR”) = 0.77], were performed to indicate 
that a statistically robust model was produced. 
The catalyst is described by the natural bond 
orbital (NBO) partial charge of the para sub- 
stituent and the buried volume within a 3.5-A 
sphere centered on the sulfonamide sulfur 
(VBurg), which indicates that a more steri- 
cally constricted catalyst pocket positively 
affects selectivity, consistent with the DFT 
TS studies. The substrate is described by the 
#3C nuclear magnetic resonance (NMR) chem- 
ical shift and partial equalization of orbital 
electronegativity (PEOE), which are both pri- 
marily electronic descriptors (38). The PEOE 
term can classify general substrate types as 
well as differentiate the various piperidine 
protecting groups that were tested during 
optimization [i.e., p-toluenesulfonyl (Ts), Tf, 
trifluoroacetyl (TFA), and so on] on the basis 
of the proximity of electronegative atoms (Fig. 
4C). Taken together, this model suggests that 
high enantioselectivity results from a com- 
bination of sterically restricted catalysts and 
dense electronegativity present in the Tf- 
protected piperidinyl substrates. Catalysis is 
optimal in the negatively charged pocket of 
the catalyst for substrates with cation stabi- 
lization features, which rationalizes the 93% 
ee we observed for IDPi (8B) with 2f and sup- 
porting the proposed TS. 


Extension to strained-ring product motifs 


Having established the conceptual groundwork 
for achieving enantioselectivity in a vinyl cation 
C(sp”)-H insertion reaction, we sought to test 
our system with new substrate classes, and we 
were particularly drawn to the possibility of 
leveraging the high energy of these intermedi- 
ates to forge strained rings. Bicyclo[3.2.1]octanes 
are prevalent carbocyclic scaffolds found in 
various bioactive natural products (39), and 
we sought to access them in enantioenriched 
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TsO allyl Si(TES)3 (1.3 equiv) 
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65 °C, 72h 
15 
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16a, 48% yield 
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chiral if @ #@ 


16c, 55% yield 


77% ee 73% e€@ 73% ee 
Product elaboration: [ i 

“Tyounovanes| 17a 75% ye 

77% ee (100% es) 
pcc RR 

eae ay el 17b, 66% yield 
74% ee (101% es, 
Ph Ar 45 °C ( ) 


Ar 


Ph 


17c, 67% yield 
72% ee (99% es) 
(>99% ee)" 


17¢ 
X-ray structure Wy 
absolute 
stereochemistry 


Fig. 5. Catalytic asymmetric synthesis of strained bicycles. All reported yields are isolated yields. *After single 
recrystallization. Ar, aryl; DCM, dichloromethane; PCC, pyridinium chlorochromate; tBu, tert-butyl. 


form. To this end, we prepared substrates 
of the type 15 (Fig. 5), for which insertion 
into the cyclohexyl group generates a chiral 
[3.2.1] carbocycle (16), and found that IDPi 
3A (Table 1) catalyzes its formation in mod- 
erate yields and enantioselectivity (up to 
77% ee). Expansion of this chemistry to other 
substrate classes that generate structurally 
distinct products suggests the potential for 
broader applicability of IDPi catalysis in 
carrying out challenging enantioselective 
C(sp*)-H functionalization reactions via vinyl 
carbocations. Moreover, products from this 
reaction can be readily converted to enantio- 
enriched 1,3-diketocyclohexanes (17) via 
oxidative cleavage (40) with high levels of enan- 
tiospecificity. Current catalytic methods to ac- 
cess enantioenriched 1,3-diketocyclohexanes are 
limited to the desymmetrization of anhydrides 
(41), after which additional steps are required 
to convert the resulting carboxylic acid to scaf- 
folds such as 17a to 17¢, which highlights a 
strategic application of this reaction. 
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Competition-level code generation with AlphaCode 


Yujia Li*+, David Choi*}, Junyoung Chung}, Nate Kushmant, Julian Schrittwiesert, Rémi Leblond{, 
Tom Eccles+, James Keeling}, Felix Gimeno}, Agustin Dal Lago;, Thomas Hubert{, Peter Choyt, 
Cyprien de Masson d’Autume?, Igor Babuschkin, Xinyun Chen, Po-Sen Huang, Johannes Welbl, 
Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J. Mankowitz, Esme Sutherland Robson, 
Pushmeet Kohli, Nando de Freitas, Koray Kavukcuoglu, Oriol Vinyals* 


Programming is a powerful and ubiquitous problem-solving tool. Systems that can assist programmers 
or even generate programs themselves could make programming more productive and accessible. 
Recent transformer-based neural network models show impressive code generation abilities yet still 
perform poorly on more complex tasks requiring problem-solving skills, such as competitive 
programming problems. Here, we introduce AlphaCode, a system for code generation that achieved 

an average ranking in the top 54.3% in simulated evaluations on recent programming competitions on 
the Codeforces platform. AlphaCode solves problems by generating millions of diverse programs using 
specially trained transformer-based networks and then filtering and clustering those programs to a 
maximum of just 10 submissions. This result marks the first time an artificial intelligence system has 
performed competitively in programming competitions. 


utomatically creating programs given a 

high-level description of what they should 

do is a long-standing task in computer 

science (J, 2). Creating an artificial intel- 

ligence (AI) system that can solve un- 
foreseen problems by generating code from 
problem descriptions is a challenge that both 
affords a greater understanding of problem 
solving and reasoning (3) and leads to impor- 
tant applications, such as improving program- 
mer productivity (4) and education (5). 

Generating code that solves a specified task 
requires searching in the enormous space of 
all possible character sequences, only a tiny 
portion of which corresponds to valid and cor- 
rect programs. Furthermore, single character 
edits can completely change program behavior 
or even cause crashes, and each task has many 
valid solutions that may be drastically differ- 
ent. These challenges make learning to gener- 
ate correct programs difficult. Most prior work 
has been limited to either restricted domain- 
specific programming languages (6) or short 
code snippets (7, 8). Perhaps the best-known 
examples of program synthesis are Flash Fill 
(6), which synthesizes programs from input 
and output examples to automatically fill in 
data in Microsoft Excel, and code comple- 
tion tools common in integrated development 
environments, which boost programmer pro- 
ductivity (9, 10). 

Contrasting with the conceptually more 
complex systems used in most of program syn- 
thesis history, recent large-scale transformer- 
based (17) language models [which achieve 
impressive performance generating text (72)] 
can, with minimal modification, solve simple 
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programming problems in Python (13, 14). A 
stripped-down version of our model performs 
similarly to these prior works (table S13). How- 
ever, those problems consist mostly of simple 
task descriptions with short solutions, and 
solving them often amounts to translating a 
sequence of steps (e.g., adding together all 
even numbers in a list) directly into code. In 
contrast, generating entire programs often 
relies on understanding the task (e.g., win a 
board game), reasoning out the appropriate 
algorithm to solve it, and then writing the 
code to implement that algorithm. 

Solving competitive programming problems 
(Fig. 1A) represents a big step forward. It re- 
quires understanding complex natural lan- 
guage descriptions, reasoning about previously 
unseen problems instead of simply memorizing 
code snippets, mastering a wide range of algo- 
rithms and data structures, and precisely im- 
plementing submissions that can span hundreds 
of lines. To evaluate these submissions (Fig. 1B), 
they are executed on an exhaustive suite of 
hidden tests and checked for execution speed 
and correctness on edge cases. Feedback is 
minimal; the submission is correct only if it has 
the correct output on all hidden tests, otherwise 
it is incorrect. Hidden tests are not visible to 


the submitter, who must instead write their 
own tests or rely on the trivial example tests for 
debugging. Because competitors are allowed 
to draw on solutions and algorithms from 
previous contests, challenging new problems 
are created for each competition. Competitive 
programming is very popular; events such as the 
International Collegiate Programming Com- 
petition (15) and the International Olympiad 
in Informatics (16) date back to the 1970s and 
are some of the most prestigious competitions 
in computer science, drawing hundreds of 
thousands of participants from around the 
world. Using problems that humans find chal- 
lenging from battle-tested competitions pro- 
vides a robust and meaningful benchmark for 
many aspects of intelligence. 

Early work using program synthesis for com- 
petitive programming has shown that large 
transformer models can achieve low single- 
digit solve rates (13, 17). In contrast, we created 
a code generation system named AlphaCode 
that manages to solve 29.6% of test set held- 
out competitive programming problems in a 
dataset we released named CodeContests, 
using at most 10 submissions per problem 
(comparable to humans). A key driver of 
AlphaCode’s performance came from scaling 
the number of model samples to orders of 
magnitude more than previous work; the 
overall solve rate scaled log-linearly with the 
number of samples generated, even when only 
10 of them were submitted, a sample scaling 
law similar to those found for training com- 
pute and model size (12). 

When evaluated on simulated program- 
ming competitions hosted on the popular 
Codeforces platform (78), AlphaCode achieved 
an average ranking within the top 54.3% of 
human participants (a small, selected subset of 
all programmers). To the best of our knowl- 
edge, a computer system has never before 
achieved such a competitive level in program- 
ming competitions. 


Learning system 


Our system (Fig. 2) was designed to address 
the main challenges of competitive program- 
ming: (i) searching in the huge space of pro- 
grams, (ii) access to only ~13,000 example tasks 


SS ee 
Table 1. AlphaCode’s results on Codeforces competitions. For each contest, we show the 
estimated percent ranking (lower is better) using simulated time and incorrect submission penalties, 
as well as the best and worst possible rankings using minimum and maximum time penalties, 
averaged over three evaluations. Percents are how many users performed better than AlphaCode. 
AlphaCode achieved an overall ranking in the top 54.3% averaged across the 10 contests. 


Contest ID| 1591 1608 1613 1615 


1617 


1618 1619 1620 1622 1623 |Average 


Maximum | 43.5% 43.6% 59.8% 60.5% 65.1% 32.2% 47.1% 54.0% 57.5% 20.6%| 48.4% 
Estimated | 44.3% 46.3% 66.1% 62.4% 73.9% 52.2% 47.3% 63.3% 66.2% 20.9%| 54.3% 


Minimum 


74.5% 95.7% 75.0% 90.4% 82.3% 53.5% 88.1% 75.1% 81.6% 55.3%) 77.2% 
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for training, and (iii) a restricted number of 
submissions per problem. Notably, scaling the 
amount of model samples generated greatly 
improved performance (Fig. 3), and therefore 
many aspects of our system were designed to 
draw samples as quickly as possible, to ensure 
sample diversity, and to select the best samples 
for submission. See supplementary materials 
(SM) text section C for full details of the method. 

Creating solutions to problems can be seen 
as a sequence-to-sequence (19) prediction task: 
Given a problem description X in natural lan- 
guage (e.g., Fig. 1A), produce a corresponding 
solution Y in a programming language (e.g., 
Fig. 1B). This task naturally motivated an 
encoder-decoder transformer architecture (J) 
for AlphaCode, which models p(Y|X). The ar- 
chitecture took as input to the encoder the 
problem description and metadata X as a flat 
sequence of tokenized (20) characters and sam- 
pled Y autoregressively from the decoder one 
token at a time until an end-of-code token was 
produced. The code was then ready to be com- 
piled and run against tests. 

We pretrained our models on a 715-GB snap- 
shot of human code from GitHub (27), with 
cross-entropy next-token prediction loss. Dur- 
ing pretraining, we randomly split code files 
into two parts, using the first part as input to 
the encoder and training the decoder to pro- 
duce the second part. This pretraining learned 
a strong prior for human coding, enabling 
subsequent task-specific fine-tuning with a 
much smaller dataset. 

We fine-tuned and evaluated models on a 
2.6-GB dataset of competitive programming 
problems that we created and released under 
the name CodeContests (22). CodeContests in- 
cludes problems, correct and incorrect human 
submissions, and test cases. The training set 
contains 13,328 problems, with an average of 
922.4 submissions per problem. The valida- 
tion and test sets contain 117 and 165 prob- 
lems, respectively. Full sets of hidden test 
cases used in competitions were not availa- 
ble, so we included extra generated tests in 
CodeContests (created by mutating existing 
tests) to decrease the false positive rate of our 
model’s submissions from 60% (comparable to 
other datasets) to 4%. We still observed that 
42% of solutions were correct but algorithmi- 
cally inefficient, running out of time or mem- 
ory on larger test inputs, which motivated the 
crucial step of evaluating model submissions 
on the official competitive programming web- 
site for the main evaluation. The validation set 
of CodeContests was used as an easy-to-measure, 
low-variance proxy of submitting to a competi- 
tion, and the test set was used only for final 
evaluations after all training and tuning had 
been completed. All data used to train the mod- 
els (in pretraining and fine-tuning) appeared 
online before all problems in the validation and 
test sets, ensuring that only information that 
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You are given two strings s and t, both consisting of lowercase 
English letters. You are going to type the string s character by 
character, from the first character to the last one. 


When typing a character, instead of pressing the button 
corresponding to it, you can press the "Backspace" button. It 
deletes the last character you have typed among those that 
aren't deleted yet (or does nothing if there are no characters in 
the current string). For example, if s is "abcbd" and you press 
Backspace instead of typing the first and the fourth characters, 
you will get the string "bd" (the first press of Backspace deletes 
no character, and the second press deletes the character ‘c’). 
Another example, if s is "abcaa" and you press Backspace instead 
of the last two letters, then the resulting text is "a". 


Your task is to determine whether you can obtain the string t, if 
you type the string s and press “Backspace” instead of typing 
several (maybe zero) characters of t. 


The first line contains a single integer q (1< q < 10°) — the number 
of test cases. 


The first line of each test case contains the string s (1< Is] < 10°). 
Each character of s is a lowercase English letter. 


The second line of each test case contains the string t (1< It] < 
10). Each character of t is a lowercase English letter. 


It is guaranteed that the total number of characters in the strings 
over all test cases does not exceed 2 - 10°. 


For each test case, print "YES" if you can obtain the string t by 
typing the string s and replacing some characters with presses 
of "Backspace" button, or "NO" if you cannot. 


You may print each letter in any case (YES, yes, Yes will all be 
recognized as positive answer, NO, no and nO will all be recognized 
as negative answer). 


Consider the example test from the 


ue statement. 


Output 
In order to obtain "ba" from “ababa’", 
you may press Backspace instead of 
typing the first and the fourth 
characters. 


There's no way to obtain "bb" while 
typing “ababa". 


There's no way to obtain "aaaa" while 
typing "aaa". 


In order to obtain "ababa" while typing 
“aababa’", you have to press Backspace 
instead of typing the first character, 

then type all the remaining characters. 


an 
AlphaCode 


© 


J 


® Solution 
(output ) 


t=int(input()) 


for i in range(t): 
If the letters at the end of 
t=input() both phrases don’t 
a=(] match, the last letter 
b=[] must be deleted. If they 
do match we can move 
onto the second last 
letter and repeat. 


— 


First the solution reads 
the two phrases. 


for Jj ans: 
a.append(j) 

forint 
b.append(j) 

a.reverse() 


b.reverse() 
e=(] 
while len(b)!=@ and len(a)!=0: 
if alo]==b[0): 
c.append(b.pop(Q) ) 
a.pop(0) 
elif a[@]!=b(@) and len(a)!=1: 
a.pop(® 
elif a[@]!=b[@] and len(a)==1: 
a.pop(0) 
if len(b)==0: 
print("YES") 


else: 
print("NO") 


If we've matched 
every letter, it’s 
possible and we 
output that. 


Backspace deletes two 
letters. The letter you 
press backspace 
instead of, and the 
letter before it. 


Fig. 1. Competitive programming problem statement and solution. (A) Problem statement of 1553D 
(“Backspace”: https://codeforces.com/problemset/problem/1553/D), a Codeforces problem (18). This 
is a medium-difficulty problem, with a rating of 1500, and its statement includes a public example test case. 
The entire statement is given to AlphaCode. (B) Solution to the problem statement, generated by Alpha- 
Code. Examples of the exact formatting of problem descriptions, solutions, and what the model perceives 


can be found at https://alphacode.deepmind.com (32). 


humans could have seen before a competition 
would have been available to the models. 

During fine-tuning, we encoded the natural 
language problem statement as a program 
comment, to make it appear more similar to 
files seen during pretraining (which can in- 
clude extended natural language comments), 
and used the same next-token prediction loss. 
We trained a variety of models ranging from 
300 million to 41 billion parameters. 

Our models included the following en- 
hancements on top of a standard transformer 
encoder-decoder architecture. Each improve- 
ment improved the solve rate, as we show in 
the evaluation section: 

1) Multi-query attention: Multi-query atten- 
tion (23) used a full set of query heads but 


shared key and value heads per attention block 
to substantially reduce memory usage and 
cache updates (which were bottlenecks during 
sampling). In combination with an encoder- 
decoder model instead of a decoder-only one, 
the change increased the sampling speed of 
AlphaCode more than 10-fold. 

2) Masked language modeling (MLM): We 
included an MLM loss (24) on the encoder, 
which empirically improved model solve rates. 

3) Tempering: Tempering (25) artificially 
made the training distribution sharper, pro- 
ducing a smoother sampling distribution (a 
regularization effect to prevent overfitting). 

4) Value conditioning and prediction: We 
used value conditioning and prediction to 
discriminate between correct and incorrect 
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OL 


(B) FINE-TUNING \ 
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I 


ily ie T ie 
ans forme Encode the problem BEUIS FOUN 
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CodeContests 
# In this 
— mn ik # problem 
rae Solution Decoder 


def solve( 5 © 


Fig. 2. Overview of AlphaCode. (A) In pretraining, files from GitHub are randomly split into two parts. The first part goes to the encoder as input, and the decoder 
is trained to produce the second part. (B) In fine-tuning, problem descriptions (formatted as comments) are given to the encoder, and the decoder is trained to 
generate the solutions. (C) For evaluation, AlphaCode generates many samples for each problem description, then it executes them to filter out bad samples and 
cluster the remaining ones before finally submitting a small set of candidates. 


problem submissions contained in CodeContests, 
providing an additional training signal and 
allowing the use of incorrect submission data 
that might otherwise mislead the model. 

5) Generation by off-policy learning from 
demonstrations (GOLD): The CodeContests 
training set contains hundreds of solutions 
for each problem. The standard cross-entropy 
next-token prediction loss would put equal 
weight on all solutions. A successful model, 
however, only needs to generate a single cor- 
rect solution for each problem. To resolve this 
discrepancy, we adopted a variation of GOLD 
(26), an offline reinforcement learning (RL) 
algorithm that focuses training on the most 
likely solutions for each problem instead of all 
possible solutions. 

At sampling time, the diversity of samples 
was important, so that the millions of samples 
for each problem could effectively explore the 
space of possible solutions. As in another pub- 
lication (27), we ensured sample diversity by 
using a high temperature and conditioning 
samples on random metadata: problem diffi- 
culty ratings, problem tags (which indicate 
which techniques a solution might use), and 
solution programming language. 
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To select the 10 best samples for submis- 
sion, we applied filtering and clustering to ob- 
tain a small number of candidate submissions 
on the basis of their program behavior. Filtering 
executed samples using example tests included 
in the problem statement and removed samples 
that failed those tests. This filtering removed 
~99% of model samples. The possibly tens of 
thousands of candidate samples that remained 
were then clustered by executing them on inputs 
generated by a separate transformer model 
trained to do test input generation and by 
grouping together programs that produced the 
same outputs on the generated inputs. We then 
picked one sample from each of the 10 largest 
clusters for submission, approximately the most 
likely program behaviors from our model. In- 
tuitively, correct programs would behave the 
same and form large clusters, but incorrect 
programs could fail in many different ways. 


Evaluation 


To assess the performance of AlphaCode, we 
evaluated it against programming compe- 
titions from the Codeforces platform (18). 
Compared with reporting the solve rate on a 
dataset, this evaluation avoids dataset assum- 


ptions and weaknesses that could skew results 
and allows us to benchmark against the best 
performers on this task-human competitors. 

We ensembled our 41 billion (41B) and 9 bil- 
lion (9B) parameter models by pooling their 
samples and then evaluated the ensemble on 
all Codeforces competitions from 1 December 
2021 to 28 December 2021 with >5000 par- 
ticipants per contest, a total of 10 competitions 
that we believe are a representative sample of 
Codeforces contests. For each contest, we sim- 
ulated running AlphaCode live, generating 
samples for each problem, filtering with ex- 
ample tests (28), and clustering to get candidate 
submissions. We submitted these candidates 
to the Codeforces platform and computed 
AlphaCode’s placement in each contest (Table 1). 
After the first run, we repeated this procedure 
two more times to measure variance. 

Overall, our system achieved an average rank- 
ing in the top 54.3% when limited to 10 sub- 
missions per problem, although 66% of solved 
problems were solved with the first submis- 
sion. This performance in competitions approx- 
imately corresponds to a novice programmer 
with a few months to a year of training (see SM 
text section G). To the best of our knowledge, 
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Table 2. Buildup ablation for model enhancements. Effect of each additional model enhancement 
building up from “No enhancements’—a plain fine-tuned 1B encoder-decoder model trained with 
the standard next-token prediction loss. Numbers in parentheses represent 95% confidence intervals. 
For each setting, we fine-tuned and sampled from at least three different models from the same 


pretrained checkpoint. 


Fine-tuning setting 


No enhancements 


77% (72-85) 


+ Clustering 


this is the first time that a computer system has 
been competitive with human participants in 
programming competitions. Training and eval- 
uating our largest 41B model on Codeforces 
required a total of 2149 petaflop/s-days and 
175 megawatt-hours [~16 times the average 
American household’s yearly energy consump- 
tion (29)]. The large resource usage required 
for this experiment both has environmental 
impacts and makes replication challenging for 
most research entities. 

To assess our modeling decisions, we eval- 
uated our models on the validation and test 
sets of CodeContests. The main CodeContests 
metric was 10@k: the percentage of problems 
solved when we take & samples from the model 
for each problem but can only submit 10 of 
them for evaluation on the hidden tests. If any 
of the 10 selected samples solved a problem, 
the problem is solved. This limit of 10 submis- 
sions mimics the upper bound of the number 
of submissions a human would typically use, 
and tracking & is important for comparisons, 
as performance improves as & increases. 

With up to 100,000 samples per problem 
(10@100K), the AlphaCode 41B model solved 
29.6% of problems in the CodeContests test 
set. Prior work on comparable datasets achieved 
low single-digit solve rates (13, 17), and direct 
comparisons can be found in SM text sec- 
tion E.3. 

Figure 3 shows how the model perfor- 
mance scaled on the 10@x metrics with more 
samples, that is, as we increased k, or with 
more compute. These scaling curves highlight 
a few notable facts about this problem domain 
and our models: 

1) Solve rates scale log-linearly with more 
samples. As shown in Fig. 3A, the 10@k solve 
rates scaled approximately log-linearly with k, 
bending down slightly at high sample bud- 
gets. The fact that sampling much more than 
10 still improved the 10@k solve rate shows 
how important it was to sufficiently explore 
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10@1K 
6.7% (6.5-6.8) 


12.2% (10.8-13.4) 


Solve rate 
10@1M 
19.6% (18.2-20.4) 


21.9% (20.7-22.6) 


28.4% (27.5-29.3) 


the search space before committing to the 
final 10 submissions per problem. However, 
improving the solve rate required exponen- 
tially increasing amounts of samples, and the 
costs quickly became prohibitive. 

2) Better models have higher slopes in the 
scaling curve. Figure 3A also shows that larger 
models tended to generate higher-quality sam- 
ples, reflected as better solve rates with the 
same number of samples and higher slopes 
in this log-linear scaling curve. Because of log- 
linear scaling, a better model could reach the 
same solve rates with exponentially fewer sam- 
ples than worse models. Improving model qual- 
ity is an effective way to counter the exponential 
explosion of sample budget required for a 
higher solve rate. 

3) Solve rates scale log-linearly with more 
compute. As shown in Fig. 3B, the solve rate 
also scaled approximately log-linearly with 
more training compute. Each vertical slice on 
the curves corresponds to one model size. 
Figure 3C shows how the solve rate scaled with 
sampling compute: Larger models required 
more compute per sample but eventually out- 
performed smaller models, as the better qual- 
ity of samples from the larger models became 
the dominant factor for performance. Both 
ways of leveraging more compute demon- 
strate log-linear scaling. 

4) Sample selection is critical to solve rate 
scaling. Figure 3D shows how 10@k solve 
rates scaled when we used different sample 
selection methods. With clustering and filter- 
ing, as was used in AlphaCode, taking more 
samples improved the solve rate. However, the 
combination was far from the theoretical up- 
per bound of performance with perfect sample 
selection. Without filtering (bottom curve), 
10 samples were randomly selected for sub- 
mission, and taking more samples did not 
improve the solve rate. 

Table 2 shows a buildup ablation of 
AlphaCode’s enhancements, which signifi- 


cantly improved the solve rate on models at 
the 1B scale. We started from the base setting 
with no enhancements (beyond the multi-query 
attention change). We added one new setting 
at a time, with the final setting that corresponds 
to AlphaCode reported at the bottom of the 
table. Combining the six enhancements to- 
gether increased the 10@1M solve rate from 
19.6% to 28.4%, although the improvement 
depended on the number of samples. 


Model analysis 


To explore the possibility of solving problems 
by exploiting weaknesses in the problem struc- 
ture, we analyzed the capabilities and limi- 
tations of our models. A common concern 
for large language models trained on large 
amounts of data is that they solve downstream 
problems by simply memorizing the training 
set (30, 31). For competitive programming to 
be a good test of problem-solving ability, mod- 
els need to come up with novel solutions to 
solve new problems. We found no evidence 
that our model copied core logic from the 
training data. Copying solutions from the train- 
ing set was not sufficient to solve any prob- 
lems in the unseen validation set, and the 
longest substrings that both human solutions 
and AlphaCode solutions shared with the train- 
ing data were of similar lengths (fig. S10). 
Further manual investigation showed that core 
logic of model solutions necessary to solve prob- 
lems could not be found in the training data. 

We also analyzed the properties of model 
samples. Like humans, models correctly solved 
easier problems more often than harder prob- 
lems; the 41B model solved 62.4% of validation 
problems rated between 800 and 1100 and 7.8% 
of problems rated between 2400 and 2700 
(higher-rated problems are more difficult). 
Models had the highest solve rates for prob- 
lems that deal with bitmasks, sorting, greedy 
algorithms, or math, but the lowest solve rates 
for dynamic programming, constructive algo- 
rithm, and graph problems. The former cat- 
egories tend to afford shorter solutions and 
the latter categories afford longer ones, indi- 
cating that AlphaCode had trouble producing 
longer solutions. 

AlphaCode was sensitive to changes in the 
problem descriptions, indicating that it did 
not exploit obvious weaknesses in the task 
structure. We modified a problem statement 
that asked for a program that can find the 
maximum product of two consecutive array 
elements. When AlphaCode generated sam- 
ples for the opposite problem (minimum in- 
stead of maximum product) and a related 
problem (any elements instead of consecutive 
ones), the percentage correctness of those 
samples on the original problem decreased from 
17.1% to 0.1% and 3.2% for the opposite and 
related rewordings, respectively. AlphaCode 
attended to important changes in problem 
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amount of compute we used for sampling, the optimal model size increased. 

(D) Improved sample selection methods were necessary to see solve rate scaling. In 
order of quality: random selection (“10@k no filtering”), filtering using example 
tests (“10@k with filtering”), clustering after filtering (“1O@k with filtering + 
clustering”), and perfect sample selection. TPU, tensor processing unit. 


descriptions that fundamentally changed the 
problem, instead of uniformly trying every re- 
lated algorithm by abusing the large sample 
budget. Conversely, alterations that funda- 
mentally left the problem unchanged affected 
solve rates less, indicating that AlphaCode 
could mostly ignore changes that humans 
could also ignore. For example, introducing 
30 adjacent character transposition typos de- 
creased the 10@1024 solve rate from 13.5% to 
11.3%. More analysis can be found in SM text 
section F. 


Conclusion 


Here, we present AlphaCode, a system applied 
to code generation for competitive program- 
ming that can generate novel solutions to un- 


seen programming problems. When evaluated 
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on Codeforces, AlphaCode performed roughly 
at the level of the median competitor. We 
found that massively scaling up sampling and 
then filtering and clustering samples to a small 
set, together with efficient transformer archi- 
tectures to support large-scale sampling, were 
essential to achieving good performance. Our 
clean dataset and robust evaluation procedure 
[released as CodeContests (22)] also contrib- 
uted substantially to guiding our research pro- 
gress. We show through detailed analysis that 
there is no evidence that AlphaCode copied 
important parts of previous solutions or ex- 
ploited weaknesses in the problem structure. 
The analysis indicates that our model was in- 
deed able to solve problems it has never seen 
before, even though those problems require 
considerable reasoning. 


Combined with careful research and engi- 
neering, the relatively simple architecture of 
transformers shows exceptional potential in 
performing reasoning necessary to solve com- 
plex problems with code. This line of work has 
exciting applications that can improve the 
productivity of programmers and make pro- 
gramming accessible to a new generation of 
developers. Our work in code generation still 
leaves much room for improvement, and we 
hope that further work will advance the fields 
of both programming and reasoning in AI. 
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CHEMICAL GENETICS 


IFITM proteins assist cellular uptake of diverse 


linked chemotypes 


Kevin Lou’, Douglas R. Wassarman’, Tangpo Yang’, YiTing Paung’, Ziyang Zhang’, 
Thomas A. O'Loughlin®®, Megan K. Moore’, Regina K. Egan’, Patricia Greninger’, Cyril H. Benes”®, 
Markus A. Seeliger®, Jack Taunton’, Luke A. Gilbert®©°°*, Kevan M. Shokat>2* 


The search for cell-permeable drugs has conventionally focused on low-molecular weight (MW), nonpolar, 
rigid chemical structures. However, emerging therapeutic strategies break traditional drug design 
rules by employing flexibly linked chemical entities composed of more than one ligand. Using 
complementary genome-scale chemical-genetic approaches we identified an endogenous chemical 
uptake pathway involving interferon-induced transmembrane proteins (IFITMs) that modulates the cell 
permeability of a prototypical biopic inhibitor of MTOR (RapaLink-1, MW: 1784 g/mol). We devised 
additional linked inhibitors targeting BCR-ABL1 (DasatiLink-1, MW: 1518 g/mol) and EIF4A1 (BisRoc-1, 
MW: 1466 g/mol), uptake of which was facilitated by IFITMs. We also found that IFITMs moderately 
assisted some proteolysis-targeting chimeras and examined the physicochemical requirements for 


involvement of this uptake pathway. 


ny therapeutic molecule that binds to an 
intracellular target must first cross the 
cell membrane. Retrospective analyses 
of compound libraries and their biolog- 
ical activities have yielded empirical 
guidelines (e.g., Lipinski’s rule of five) that 
enrich for lead-like scaffolds with high passive 
permeability and largely define modern drug- 
like chemical space (/-3). Although these prin- 
ciples have been useful for streamlining the 
search for new therapeutics, many important 
intracellular drug targets are currently refrac- 
tory to inhibition by these compact, hydro- 
phobic, and rigid molecules. One emerging 
design framework that seeks to address these 
challenges involves increasing pharmacologi- 


cal complexity by linking multiple ligands into 
a single chemical entity (a linked chemotype). 
Linked chemotypes can have enhanced po- 
tency, greater selectivity, and the capacity to 
induce the association of more than one tar- 
get (4-11). This modular rapid access to high 
molecular weight, amphiphilicity, and rota- 
tional flexibility can provide useful chemical 
probes and therapeutic leads for intracellular 
targets, as long as the resulting molecules re- 
main cell-permeable. 

Mechanisms to understand and predict the 
cell permeability of linked chemotypes, how- 
ever, remain limited. Other medium-to-high 
molecular weight therapeutics such as natu- 
ral products and synthetic macrocycles often 
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constitute highly tailored arrangements of 
polar and nonpolar functionality that allow 
switching between membrane-favored and 
aqueous-favored conformations to enable pas- 
sive permeability through membranes (12). Ad- 
ditionally, cell-penetrating proteins/peptides 
commonly require appendage of highly charged 
moieties to enable electrostatic interactions 
with the plasma membrane and subsequent 
internalization (13-15). Studies involving the 
most rapidly expanding linked chemotype 
class—proteolysis-targeting chimeras (PROTACs) 
(16)—provide varying insights into the deter- 
minants of cell permeability (17-22). Despite 
their atypical properties, PROTACs and addi- 
tional large molecules such as the dimeric 
immunophilin ligand rimiducid have shown 
in-cell activity robust enough to enter clinical 
trials (16, 23). 

Given this discrepancy between the favor- 
able biological activity of many large, bivalent 
molecules and traditional concepts of passive 
permeability, we inferred that linked chemo- 
types might hijack cellular processes to assist 
with passage through the cell membrane. We 
selected as an example a bitopic inhibitor of 
MTOR, RapaLink-1 (7), whose molecular weight 
(1784 g/mol) falls well beyond common guide- 
lines (< 500 g/mol) (2), and even beyond that of 
typical PROTACs (800 to 1200 g/mol) (table S1) 
(18). RapaLink-I’s atypical structure—composed 
of the allosteric MTOR inhibitor rapamycin 
and the active site inhibitor sapanisertib linked 
by an 8-unit polyethylene glycol (PEGS8) tether 
(Fig. 1A)—confers enhanced selectivity for MTOR 
complex 1 over MTOR complex 2 (7, 24, 25). 
The molecule is highly active in vivo, pene- 
trates the blood-brain barrier, and serves as a 
prototype for the clinical candidate RMC-5552 
(7, 24-27), establishing itself as a drug-like 
compound that defies most traditional no- 
tions of drug-like structures. We hypothesized 
that cellular mechanisms assisting RapaLink-1’s 
cytoplasmic entry could be identified by sys- 
tematically perturbing genes that modulate 
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the molecule’s ability to reach and inhibit its 
intracellular target. 


Complementary genome-scale chemical-genetic 
approaches identify IFITMs as regulators of 
RapaLink-1 cellular activity 


We probed canonical protein-coding genes for 
cellular factors that determine RapaLink-1 
uptake and sensitivity using a dCas9-based 
CRISPRi/a functional genomics platform (28, 29). 
Gene expression inhibition and activation, 
through CRISPRi and CRISPRa, respectively, 
act as complementary approaches to map 
chemical-genetic interactions at the genome 
scale. In particular, genes displaying strong 
mirrored phenotypes (resistance upon knock- 
down and sensitivity upon overexpression) are 
likely to be directly involved in a small mol- 
ecule’s mechanism of action. This integrated 
approach to identifying physiologically rele- 
vant chemical-genetic interactions was pro- 
posed by Jost et al. and its utility has recently 
been reviewed (30, 31). In addition to the 
bitopic inhibitor, we also assessed sapaniser- 
tib, rapamycin, and an unlinked control (a 1:1 
mixture of sapanisertib and rapamycin) to 
distinguish chemical-genetic interactions spe- 
cific to the linked chemotype (Fig. 1A). 

Patient-derived chronic myeloid leukemia 
(CML) cells, K562, pre-engineered to express 
CRISPRi or CRISPRa machinery, were trans- 
duced with their respective genome-scale 
sgRNA libraries, selected with puromycin to 
remove nontransduced cells, and treated with 
DMSO, sapanisertib, rapamycin, sapanisertib + 
rapamycin, or RapaLink-1. The experiments 
were conducted with high-replicate reprodu- 
cibility (fig. S1, A to D), and data from the 
genome-scale CRISPRi (data S1 and $2) and 
CRISPRa (data S3 and S4) screens were 
juxtaposed to highlight genes that displayed 
mirrored phenotypes (Fig. 1B). This arrange- 
ment distributes genes which functionally 
synergize with the inhibitor in the lower right 
(e.g., FKBP12, the required inhibitory complex 
partner of rapamycin) and those which antag- 
onize the inhibitor in the upper left (e.g., MTOR, 
the direct target) (30, 31). Chemical-genetic 
interactions with MTOR signaling components, 
particularly the Ragulator complex (RRAGA, 
RRAGC, and LAMTORI-5) and nodes down- 
stream of PI3K/AKT (TSCI, TSC2, and RHEB), 
were observed across multiple inhibitor con- 
ditions (fig. S2, A and B), consistent with known 
pathway relationships (32) and prior functional 
genomics studies (33, 34). 

A distinct set of chemical-genetic interac- 
tions were identified as top hits with RapaLink-1 
and not with any of the nonlinked molecules 
tested, suggesting the involvement of a bio- 
logical pathway that promotes the activity of 
the linked chemotype. The expression of mem- 
bers of a highly homologous gene family— 
interferon-induced transmembrane proteins 


(IFITMs) [FITM1, IFITM2, and IFITM3 (35)— 
synergized with the activity of RapaLink-1 and 
not its nonlinked counterparts, sapanisertib 
and rapamycin (Fig. 1B). To validate this find- 
ing, we tested sgRNAs targeting JFITMI-3 in- 
dividually for transcriptional repression or 
activation (fig. S3A and table S2). CRISPRi- 
mediated knockdown of IFITM1-3 was potent 
and selective (fig. S3B). CRISPRa-mediated 
overexpression was also potent although we 
observed variable cross activation between 
family members (fig. S3C), possibly due to 
concerted transcriptional regulation of these 
genes, which are adjacent to each other on 
chromosome II (fig. S3A) (36). We individually 
confirmed that top screen hits, including FKBP12 
and JFITM1-3, synergized with RapaLink-1 
in a competitive growth assay; we also vali- 
dated that the JFITM1-3 chemical-genetic in- 
teraction was specific to the linked chemotype 
(fig. S83, D and E). 

Seeking to generalize these observations 
beyond a single cell type, we employed an in- 
dependent chemical-genetic approach corre- 
lating MTOR inhibitor sensitivity data with 
basal gene expression in diverse in vitro mod- 
els (37-39). More than 500 cancer cell lines 
were assessed for sensitivity to sapanisertib, 
rapamycin, or RapaLink-1 (measured by area 
under the dose-response curve). These mea- 
surements were correlated with gene transcript 
abundance (measured by RNA sequencing) 
across the cell lines to identify predictive bio- 
markers for compound sensitivity or resis- 
tance. High expression of any of the three 
IFITM family members was strongly associ- 
ated with enhanced RapaLink-1 sensitivity 
across 659 cell lines, and JFITM2 was notably 
the single most associated sensitizing bio- 
marker (negative correlation) for Rapalink-1 
(Fig. 1C and data S5). This correlation was ab- 
sent for sapanisertib and rapamycin (fig. S4, A 
to C, and data S5), recapitulating the CRISPRi/ 
ascreens. Together, our analysis of the CRISPRi/ 
ascreens and large-scale chemogenomic cell 
line profiling experiments suggested a role of 
IFITMs in promoting the activity of RapaLink-1 
across diverse cell types and levels of IFITM 
expression. 


IFITMs promote RapaLink-1 pharmacodynamic 
target engagement 


To unmask potentially overlapping IFITM func- 
tions, we knocked down JFITM1, IFITM2, and 
IFITM3 expression simultaneously by coex- 
pressing three different targeting sgRNAs 
(table S2) (40). Although multigene knock- 
down was potent (fig. S5A), we did not observe 
baseline changes in cell viability (fig. S5B). 
IFITM1-3 triple knockdown ablated MTOR 
inhibition by 3 nM RapaLink-1 in cells, as de- 
termined by intracellular markers of MTOR 
pathway signaling, phospho-S6*”*"’?3>, phospho- 
4EBP1'?”/*6, and phospho-AKT™ (Fig. 1D), 
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Fig. 1. IFITMs promote the cellular activity of a bitopic MTOR inhibitor. 

(A) Chemical structures of MTOR inhibitors. (B) Gene phenotypes from genome- 
scale CRISPRi and CRISPRa screens in K562 cells. Genes involved in MTOR 
complex 1 (MTOR and RPTOR), a requisite rapamycin inhibitory complex partner 
(FKBP12), and clade | IFITMs (IFITMI1, IFITM2, and IFITM3) are highlighted. 

Data represent two biological replicates. (©) Spearman correlation coefficients 
between RapaLink-1 sensitivity, as measured by dose-response data, and 


Relative cell viability (%) 
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transcript abundance, as measured by RNA sequencing (see also fig. S4). Dose- 
response data are expressed as area under the curve (AUC) and RNA sequencing 
data are expressed as transcripts per million (TPM). Genes are highlighted as in 
(B). (D) Immunoblots of K562 CRISPRI cells expressing sgRNAs treated with 
RapaLink-1 (3 nM) for the times indicated. [(E) and (F)] Viability of K562 CRISPRi 
(E) or CRISPRa (F) cells expressing sgRNAs treated with RapaLink-1. Data represent 
means of three biological replicates; error bars denote SD. 


and conferred resistance to the linked mole- 
cule (Fig. IE and fig. S5C). Overall, IFITM ex- 
pression perturbation by CRISPRi and CRISPRa 
caused a combined 29.5-fold modulation in 
cellular potency of the molecule (Fig. 1, E and 
F). As observed previously (7), RapaLink-1 re- 
quires multiple hours to achieve maximal 
pharmacodynamic inhibition (Fig. 1D). This 
contrasts with the typical finding that small 
molecules reach their intracellular targets on a 
timescale of seconds to minutes (47) and may 
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reflect how linked chemotypes exhibit distinct 
permeability characteristics from traditional 
drug-like molecules. 

Neither of the nonlinked MTOR inhibitors 
tested demonstrated chemical-genetic interac- 
tions with JFITM1-3 and thus IFITMs likely 
do not directly modulate MTOR signaling 
but instead cooperate with some aspect of 
RapaLink-1 not shared with the other inhibi- 
tors. Clade I IFITM family members, /FTTM1-3, 
are closely related broad spectrum viral restric- 


tion factors that localize to the plasma and 
endolysosomal membranes (42-45). They are 
thought to perform their antiviral function in 
part by rendering local membrane character- 
istics at the viral-endosomal juncture unfavor- 
able for viral entry (46), although in some cases 
viruses can also hijack IFITMs to facilitate 
entry and infection (47). In addition to their es- 
tablished immunologic function, clade I IFITMs 
are also reported to modulate an oncogenic 
phenotype (48), affect placenta formation (49), 
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Fig. 2. IFITMs promote the cellular uptake of linked chemotypes. (A) Chemical 
structures of fluorescent RapaLink-1 analogs. (B) Measurement of fluorescent 
molecule uptake in K562 CRISPRi cells expressing sgRNAs (sgRNA+). Cells were 
incubated with TAMRA-N3 (10 nM), TAMRA-PEG8-N3 (1 uM), or RapaTAMRA-PEG8 
(1 nM) for 24 hours. Uptake modulation by sgRNAs was quantified by internal 
normalization to nontransduced cells (sgRNA-) present within the mixture (i.e., 
relative cellular uptake). Data representative of three biological replicates. 
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(sgRNA+ cells traced with dotted outlines) 


(C) Changes in uptake of fluorescent molecules by sgRNAs targeting IFITM1-3 as 
in (B) and fig. S6A. Relative cellular uptake <1 indicates decreased uptake and 

>1 indicates increased uptake. Data represent means of three biological replicates. 
(D) Confocal microscopy images of RPE-1 CRISPRi cells expressing indicated 
sgRNAs (blue) and treated for 24 hours with RapaTAMRA-PEG8 (magenta) and 
LysoTracker (green). sgRNA+ cells are traced with dotted outlines (yellow) in left two 
columns for clarity. Scale bar denotes 20 um. 
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and contribute to cellular homeostasis (44). In 
turn, Rapalink-1 might interact with IFITMs 
through a cellular pathway that promotes up- 
take of large molecules. 


A fluorescent RapaLink-1 analog reveals a role 
for IFITMs in linked chemotype uptake 


To explore our uptake hypothesis, we created 
a fluorescent analog of RapaLink-1 to directly 
observe the effect of IFITM expression on ac- 
cumulation of the linked chemotype in live 
cells. This fluorescent molecule, RapaTAMRA- 
PEG8, was designed by replacing the adeno- 
sine triphosphate (ATP)-site binding element 
in RapaLink-1 with tetramethylrhodamine 
(TAMRA), resulting in a fluorescent derivative 
that closely mimics the physicochemical prop- 
erties of the original molecule (Fig. 2A and 
table S1) (27). Analogs representing partial 
components of RapaTAMRA-PEG8, TAMRA-Ns, 
and TAMRA-PEG8-Ns, were additionally eval- 
uated to assess whether the uptake pathway 
extended to generic compact-hydrophobic or 
linked-amphiphilic chemotypes respectively 
(Fig. 2A). We quantified accumulation of these 
molecules by flow cytometry using a quanti- 
tative live cell fluorescence uptake assay in 
which a mixture of transduced (sgRNA*) and 
nontransduced (sgRNA ) cells were equally 
exposed to compounds within the same well 
(Fig. 2B and fig. S6A). Changes in cellular up- 
take resulting from CRISPRi/a expression 
modulation by sgRNAs (fig. S3, A to C) again 
revealed a chemotype-specific IFITM depen- 
dency pattern (Fig. 2, B and C, and fig. S6A). 
Both linked chemotypes—TAMRA-PEG8-N; 
and RapaTAMRA-PEG8—demonstrated de- 
creased uptake upon knockdown of IFITM1-3 
and increased uptake upon overexpression. 
The linker-less chemotype TAMRA-Ns, by con- 
trast, exhibited no such chemical-genetic inter- 
actions. CRISPRi/a-induced uptake differences 
observed for RapaTAMRA-PEG8 correlated 
strongly with resistance and sensitivity pheno- 
types for RapaLink-1 (fig. S6B), suggesting a 
direct association between measured uptake 
and functional target inhibition. The observa- 
tion that uptake of TAMRA-PEG8-N;—a ge- 
neric linked chemotype not specifically bound 
by any cellular protein—was also IFITM-assisted 
suggests that this uptake mechanism might 
also be used by other linked molecules. 
Additionally, we assessed the role of IFITMs 
on the subcellular localization of RapaTAMRA- 
PEGS by confocal microscopy in a human non- 
transformed cell line, RPE-1, pre-engineered to 
express CRISPRi machinery. Again a mixture 
of sgRNA* and sgRNA’ cells were imaged in 
the same well following equal exposure to the 
fluorescent compound. JFITM1-3 triple-knock- 
down significantly reduced the amount of 
RapaTAMRA-PEG8 entering the intracellular 
compartment (Fig. 2D and fig. S6C). A reduc- 
tion in signal was also observed within the 
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endolysosomal compartment (Fig. 2D and fig. 
S6C), suggesting that IFITMs, which localize 
to the plasma membrane as well as to endo- 
lysosomal membranes (43-45), may play a role 
in facilitating RapaTAMRA-PEG8 uptake through 
endocytic vesicles and into the intracellular 
space. Consistent with this, our functional ge- 
nomics screens identified RapaLink-1-specific 
chemical-genetic interactions among endoso- 
mal (ARF6, VPS26A, VPS29, and VPS35) and 
sterol (OSBP, GRAMDIA, INSIGI, and SCAP) 
regulatory genes (fig. S7, A and B). This, in part, 
resembles the role IFITMs play as antiviral 
effectors in which biophysical interactions with 
incoming viral particles (50) and membrane 
sterols (57) may hinder or assist infection of 
target cells (43-45). Considering the large di- 
versity of viruses IFITMs are described to 
interact with, we hypothesized that the up- 
take assistance afforded to RapaLink-1 and 
RapaTAMRA-PEG8 by IFITMs might also 
extend to other linked chemotypes with sim- 
ilar physicochemical properties. 


DasatiLink-1 is an IFITM-assisted 
bitopic inhibitor of BCR-ABL1 with 
enhanced selectivity 


To explore the generalizability of this IFITM- 
promoted cellular uptake mechanism, we de- 
signed, synthesized, and characterized a bitopic 
inhibitor that is, aside from being a linked mo- 
lecule, compositionally unrelated to RapaLink-1. 
This inhibitor targets a different intracellular 
protein, BCR-ABLI, a fusion oncoprotein asso- 
ciated with CML and other leukemias (52). 
BCR-ABL1 harbors two well-defined small- 
molecule binding sites within its kinase do- 
main (Fig. 3A): the ATP pocket (53), which is 
targeted by five clinically approved compounds 
(e.g., dasatinib) (54), and the myristoyl pocket 
(55), which is targeted by the clinical inhibitor 
asciminib (56). These sites can also be bound 
by the two classes of inhibitors simultaneously 
(55-57). The two pockets span a distance sim- 
ilar to that engaged by RapaLink-1 in MTOR 
(7), suggesting that a similar bitopic inhibi- 
tor linkage strategy could apply to BCR-ABL1. 
We devised a bitopic inhibitor of BCR-ABL1, 
DasatiLink-1, based on the linking of dasatinib 
and asciminib by a flexible tether whose length 
(41 heavy atoms) was close to that of RapaLink-1 
(39 heavy atoms) (Fig. 3B). 

We characterized the interaction between 
DasatiLink-1 and its target using in vitro bio- 
chemical assays. Treatment of purified BCR- 
ABL1 kinase domain with dasatinib or asciminib 
resulted in marked (>0.1 ppm) nuclear magnet- 
ic resonance (NMR) chemical shift differences 
in residues involved in binding to the monomer- 
ic inhibitors (fig. S8, A and B), consistent with 
previous reports (55, 56, 58). The NMR spectrum 
observed in the presence of Dasatilink-1 closely 
matched the spectrum observed with a mix- 
ture of the two nonlinked inhibitors (fig. S8, 


A and B), suggesting that the linked inhibitor 
simultaneously binds to both sites and that the 
tether does not prevent binding to either site. 

We hypothesized that DasatiLink-1 might 
require an allosteric foothold to achieve high 
occupancy of the BCR-ABLI1 kinase domain, 
and we tested this hypothesis using a pull- 
down assay for ATP site availability (59). We 
confirmed that the assay recapitulated a bio- 
chemical ICs9 of <1 nM for dasatinib (60), 
which was unaffected by inclusion of 100-fold 
excess of the allosteric inhibitor asciminib (fig. 
S8C). However, addition of excess asciminib 
impaired the ability of DasatiLink-1 to occupy 
the ATP site, likely resulting from a loss of 
avidity following steric occlusion of the allo- 
steric pocket (fig. S8C). This indicates that 
Dasatilink-1 relies on both the orthosteric and 
allosteric sites for binding, suggesting that it 
might also exhibit the enhanced selectivity often 
observed in bitopic inhibitors (67). Together, 
these biochemical data validate DasatiLink-1 
as a bitopic inhibitor with physicochemical 
properties beyond standard drug design limits 
(1-3), and we reasoned that the molecule’s 
linked composition might allow it to be as- 
sisted into the cell by IFITMs. 

Returning to our K562 CRISPRi/a models, 
which are patient-derived BCR-ABL1 mutant 
CML cells, we characterized the effect of IFITM 
expression on the ability of DasatiLink-1 to 
inhibit intracellular BCR-ABL1 signaling. Sim- 
ilar to RapaLink-1 (Fig. 1, E and F), CRISPRi 
and CRISPRa perturbation of IFITM expression 
resulted in a combined 8.9-fold modulation of 
DasatiLink-1 cellular potency (Fig. 3, C and D). 
We also probed the capacity of DasatiLink-1 to 
engage intracellular BCR-ABL1 by measuring 
pharmacodynamic markers of inhibition. Con- 
sistent with an IFITM-assisted uptake mech- 
anism, JFITMI-3 triple knockdown reduced 
the ability of DasatiLink-1 to inhibit phospho- 
BCR-ABL1** and phospho-STAT5S**™, which 
are markers of BCR-ABL1 activity (Fig. 3E). The 
maximal inhibition observed for DasatiLink-1 
in the TriNegCtrl sg conditions at 8 and 24: hours 
was not ever reached in the JFITMI1-3 triple 
knockdown conditions, likely due to lower in- 
tracellular concentrations of compounds re- 
sulting from decreased uptake. The inhibition 
kinetics we observed for the negative control 
treatment, requiring multiple hours for max- 
imal inhibition at a nanomolar concentration 
(Fig. 3E), were also exhibited by RapaLink-1 
(Fig. 1D). 

We anticipated that DasatiLink-1, akin to 
RapaLink-1, was likely to be selective for its 
target as a result of its multivalent binding 
mechanism—only BCR-ABL1 kinase domain 
contains binding sites for both of its linked 
components. We assessed DasatiLink-1’s 
kinome-wide selectivity in live cells using a 
promiscuous kinase occupancy probe, XO44 
(62), with which kinase active site occupancy can 
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Fig. 3. Design and characterization of an IFITM-assisted bitopic BCR-ABL1 
inhibitor. (A) Molecular model of ABL1 kinase domain. The model was 
constructed by aligning two crystal structures: one bound to dasatinib (PDB, 
2GQG) and one bound to asciminib (PDB, 5MO4). (B) Chemical structures 


biological replicates; error bars denote SD. (E) Immunoblots of K562 CRISPRi 
cells expressing sgRNAs treated with DasatiLink-1 (5 nM) for the times indicated 
(F) In-cell kinase occupancy profiling of DasatiLink-1 and an unlinked control 

(a 1:1 mixture of dasatinib and asciminib) at equimolar concentration (100 nM). Data 


of BCR-ABL1 inhibitors. [(C) and (D)] Viability of K562 CRISPRi (C) or CRISPRa (D) 
cells expressing sgRNAs treated with DasatiLink-1. Data represent means of three 


be determined through competitive activity- 
based protein profiling (63). In contrast to 
an unlinked control (a 1:1 mixture of dasatinib 
and asciminib) at equimolar concentration, 
which competed with XO044 for labeling of 
numerous known dasatinib targets (62), pre- 
treatment with DasatiLink-1 resulted in ob- 
servable intracellular occupancy of only ABL1 
(Fig. 3F and data S6). This single kinase spe- 
cificity extended over a 100-fold concentration 
range up to 1 uM, the same range over which the 
unlinked control demonstrated dose-responsive 
occupancy of numerous off targets (Fig. 3G 
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and data S6). These data suggest that target 
selectivity can be conferred by two-site bind- 
ing, analogous to RapaLink-1’s selectivity for 
MTOR complex 1 (7, 24, 25). 


BisRoc-1 analogs reveal linker length 
dependency of IFITM-assisted cellular uptake 


To further examine the breadth of linked 
chemotypes that might be assisted by IFITMs, 
we designed, synthesized, and characterized a 
new linked molecular glue inhibitor based on 
the natural product rocaglamide. Rocaglamide 
clamps the EIF4A1 helicase to 5’ untranslated 


represent means of three biological replicates. (G) As in (F) for kinases occupied 
following 10 nM, 100 nM, and 1 pM inhibitor treatments; error bars denote SD. 


regions (UTRs) of target mRNAs to inhibit the 
translation of downstream sequences (64). The 
crystal structure of the complex of rocaglamide, 
EIF4A1, and polypurine RNA (65) revealed that 
the molecule’s amide points toward free sol- 
vent, near a symmetry mate (fig. S9A). We 
reasoned that dimerization of rocaglamide 
through its amide position could be a chem- 
ically tractable means to simultaneously en- 
gage two proximal EIF4A1-RNA complexes 
within the cell. We designed a molecule, BisRoc-1 
(fig. S9B), that links two rocaglamide mono- 
mers together with a linker length (35 heavy 
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Fig. 4. IFITMs assist the cellular activity of diverse linked chemotypes. 

(A) Heavy atom skeletons of compounds assessed for IFITM assistance (see also 
fig. S10 for chemical structures). Compounds were categorized as nonlinked 
chemotypes (compounds 1 to 8, black), linked chemotypes with short linkers 
(compounds 9 to 14, gray), or linked chemotypes with long linkers (compounds 
15 to 17, green). (B) Chemical-genetic interaction map of inhibitors in (A) with 
IFITM1, IFTM2, and IFITM3. Potency, as measured by dose-response |Cso in a 
cell viability assay (see also Fig. 1F, Fig. 3D, or fig. S9D for example source data), 


bonds 


was normalized to that of non-sgRNA-expressing K562 CRISPRi or CRISPRa 
cells. Physicochemical properties, including molecular weight (MW) and number 
of rotatable bonds, with their respective traditional thresholds for drug-likeness 
are indicated (right). Data represent means of three biological replicates. 

(C) Map of chemical space populated by 304 kinase inhibitors in clinical 
development (black), 3270 PROTACs reported in the literature (gray), and 

3 linked chemotypes described herein (green). Boundaries represent traditional 
guidelines for drug-likeness. 


atoms) exceeding the distance separating two 
rocaglamide binding sites in the crystal struc- 
ture (fig. S9A). Similar to RapaLink-1 and 
DasatiLink-1, CRISPRi and CRISPRa pertur- 
bation of IFITM expression resulted in a com- 
bined 6.2-fold modulation in cellular activity 
of BisRoc-1 (fig. S9, C and D). Additionally, 
we evaluated the relationship between linker 
length and IFITM assistance by examining an 
analog series consisting of BisRoc-1 (PEG11), 
BisRoc-2 (PEG4), BisRoc-3 (PEG2), and roca- 
glamide (no linker) (fig. S9E). We treated our 
K562 CRISPRi and CRISPRa cells with these 
inhibitors and evaluated differences in poten- 
cy resulting from IFITM expression modula- 
tion, as measured by a half-maximal inhibitory 
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concentration (IC;9) shift in a cell viability as- 
say (fig. S9, E and F, and data S7). This re- 
vealed a pattern in which longer linker lengths 
correlated with greater IFITM assistance. Com- 
bined, these data suggest the general feasi- 
bility of retaining cell permeability despite 
increased pharmacophore size, polarity, and 
flexibility in the context of linked chemotypes 
described herein. 


An expanded chemical space for 
cell-permeable molecules 


Given the ubiquitous presence of IFITMs in 
cells, we hypothesized that the cellular uptake 
of other linked inhibitors in the literature 
could also be assisted by IFITMs. Although 


generally not as large as the linked chemo- 
types described above, PROTACs are likewise 
composed of two chemical entities covalently 
attached by a flexible tether (16). Thus, we in- 
cluded four PROTACs (GMB-475, MZ1, BETd- 
260, and dBET6) and their nonlinked parent 
inhibitors in an expanded survey of chemical- 
genetic interactions with IFITMs (Fig. 4A, fig. 
$10, and table S1). These compounds were eval- 
uated in the same IFITM dependency anal- 
ysis as the BisRoc linker series (Fig. 4B and 
data S7). Using RapaLink-1 as a chemical 
benchmark, we observed that IFITM1, IFITM2, 
and JFITM3 overexpression sensitized cells to 
linked chemotypes (Fig. 4B; compounds 9 to 
17). The inverse finding—resistance to linked 
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chemotypes—resulted from gene knockdown 
(Fig. 4B). The trend observed in the BisRoc 
series was corroborated across the nine bivalent 
molecules tested (Fig. 4B): the magnitudes of 
chemical-genetic interactions correlated with 
linker length which is reflected in inhibitor 
size (molecular weight) and flexibility (num- 
ber of rotatable bonds). Linked chemotypes 
with long linkers were more IFITM-assisted 
than linked chemotypes with short linkers, and 
nonlinked chemotypes (Fig. 4B; compounds 
1 to 8) were not observed to be assisted by 
IFITMs (Fig. 4B). Despite their cellular activ- 
ities, the physicochemical properties of the 
linked chemotypes largely violate Lipinski’s 
(1) and Veber’s (2) classic guidelines (Fig. 4, 
A to C, and table SI), raising the need for a 
revised drug design framework that considers 
IFITM-assisted uptake and other cellular im- 
port processes. 


Discussion 


Through a combination of functional genomics 
and chemical methods, we uncovered an endo- 
genous pathway involving IFITMs that, in our 
data, promotes the cellular uptake of diverse 
linked chemotypes. With the clinical advance- 
ment of a dimeric immunophilin ligand (23), 
PROTACSs (16), and a RapaLink-1 derivative 
(26), the notion of “drug-like” is continually 
being revised. As evidence, the chemical space 
(66) populated by an ever-expanding set of 
linked preclinical compounds in the literature 
ventures beyond that occupied by lead inhib- 
itors developed under traditional guidelines 
(Fig. 4C) (1-3). Here, we identify IFITM- 
assisted cellular uptake as one of the mech- 
anisms by which linked inhibitors are able to 
break previously established rules surround- 
ing drug likeness. We anticipate that our 
findings will inform the uptake optimization 
of emerging classes of bivalent molecules 
(PROTACSs, Syn-TEFs, RIBOTACs, PHICS, 
DUBTACs, and others) (4-17) and enable the 
design of cell-permeable therapeutics that 
bridge distal binding sites on solitary targets 
or multitarget complexes. 
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Mars—implications for organic geochemistry 
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The Perseverance rover landed in Jezero crater, Mars, in February 2021. We used the Scanning Habitable 


Environments with Raman and Luminescence for Organics and Chemicals (SHERLOC) instrument to 


perform deep-ultraviolet Raman and fluorescence spectroscopy of three rocks within the crater. We 
identify evidence for two distinct ancient aqueous environments at different times. Reactions with liquid 
water formed carbonates in an olivine-rich igneous rock. A sulfate-perchlorate mixture is present in the 
rocks, which probably formed by later modifications of the rocks by brine. Fluorescence signatures 


consistent with aromatic organic compounds occur 
minerals related to both aqueous environments. 


he Perseverance rover landed in Jezero 
crater, Mars, to investigate the geology of 
the crater, identify habitable environments, 
assess whether life ever existed on Mars, 
and collect samples for potential return 
to Earth (7). Jezero hosted an open-basin lake 
during the Noachian era (~3.7 billion years ago) 
(/, 2), contains geologic units associated with 
the largest carbonate deposit identified on Mars 
(2-4), and contains a well-preserved delta with 
clay and carbonate-bearing sediments, which 
might contain organics (7). Organics have pre- 
viously been de- tected on Mars (5, 6). 
We investigated the spatial and mineralog- 
ical context of organics in Jezero crater using 


throughout these rocks and are preserved in 


the rover’s Scanning Habitable Environments 
with Raman and Luminescence for Organics 
and Chemicals (SHERLOC) instrument, a deep- 
ultraviolet fluorescence and Raman spectrom- 
eter capable of mapping organic and mineral 
composition with a spatial resolution of 100 um 
(7). Complementary elemental chemistry anal- 
yses were performed using the Planetary In- 
strument for X-ray Lithochemistry (PIXL) (8-17) 
and SuperCam instruments (9). 

We identify organics and aqueously formed 
minerals at Jezero crater in three rock targets 
(8) analyzed during the first 208 martian days 
of the mission (Fig. 1) located in two different 
geological units within the floor of Jezero 


crater (9, 12). The Garde target is from the 
altered ultramafic Séitah formation (Fm), or- 
bitally mapped as the Crater Floor Fractured 
1 unit (CF-f1) (Fig. 1) (9, 12). The Guillaumes 
and Bellegarde targets are from the overlying, 
and therefore younger, basaltic Maaz Fm, 
orbitally mapped as the ~2.3-billion- to 2.6- 
billion-year-old (13) Crater Floor Fractured 
Rough unit (CF-fr) (9, 12). The Perseverance 
rover drilled four rock samples from the Séitah 
Fm. Two drilled rock samples were obtained 
from the Bellegarde rock, whereas the Guil- 
laumes drilled rock sample attempt, Roubion, 
failed (12). These six rock samples are planned 
to be returned to Earth. 

All three Raman spectral scans (8) from 
Garde exhibit strong peaks at Raman shifts 
between 1080 and 1090 cm’ (investigated in 
38 separate scan points) that are attributed 
to carbonate [spectrum 1 and regions of in- 
terest (ROIs) 1 to 4 in Fig. 2H] and peaks with 
a peak position range of 820 to 840 cm (n = 
60) that are attributed to olivine (ROIs 1 and 
4 in Fig. 2H) (8, 13, 14). Olivines were found 
to be more Fe-rich than laboratory measured 
olivines, with fosterite numbers [defined as 
Mg/(Mg + Fe*) x 100] of 80 to 90 (13), whereas 
carbonates are likely mixed Fe- and Mg-species 
based on 1080- to 1087-cm peak positions (8) 
and Ca-dominated species are excluded based 
on PIXL data (71). These spectral detections 
were overlaid on Wide-Angle Topographic Sen- 
sor for Operations and eNgineering (WATSON) 
camera images to compare spectral positions 
with textures (8). Olivine and carbonate are 
associated with micrometer- to millimeter-sized 
light-toned tan, reddish-brown, and dark-toned 
subangular grains as well as light-toned inter- 
granular spaces (Fig. 2, B and E). Spectral fea- 
tures of olivines and carbonates often co-occur 
in a single spectrum; however, there are also 
areas where either olivine or carbonate occur 
independently. Spectral observations of a weak, 
broad Raman peak centered at ~1060 cm“? 
(full width at half maximum ~200 cm“) could 
indicate a disordered phase consistent with 
amorphous silicates, which is often difficult 
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to detect given the low band intensity (Fig. 2). 
A peak at 960 + 5 cm’ is likely phosphate, 
although perchlorates cannot be excluded 
(Fig. 2). 

Garde detail scans (8) exhibit strong fluo- 
rescence signatures, centered at ~340 nm, that 
spatially correlate with carbonate, probable 
phosphate, and amorphous silicate spectra lo- 
calized within narrow intergranular spaces 
(Fig. 2, E and F). A less-intense fluorescence 
band centered at 285 nm typically accompa- 
nies the 340-nm fluorescence. Other areas 
exhibit no fluorescence (Fig. 2C). 

Guillaumes features white and reddish-brown 
patches, 1 to 2 mm across, that do not have 
well-formed crystal faces (Fig. 3A and fig. S1). 
These are secondary materials within a basal- 
tic igneous rock (9), which we interpret as void 
fills, that correlate with sulfate and perchlo- 
rate spectra. Spectra with high-intensity 950- 
to 955-cm" peaks and minor 1090- to 1095-cm™ 
and 1150- to 1155-cm' peaks match laboratory 
measurements of anhydrous Na-perchlorate 
(8) (Fig. 3G and fig. $4). Two strong Na- 
perchlorate detections correlate with centers 
of the brightest material within the patches 
(8). Guillaumes spectra commonly contain a 
single low-intensity peak positioned at 950 to 
955 cm‘. We interpret these as low-intensity 
Na-perchlorate peaks, although the cation 
species is uncertain because of a lack of re- 
solvable minor peaks (8) (Fig. 3G). Other 
spectra exhibit both 950- to 955-cm™' peaks 
and equally strong 1010- to 1020-cm * peaks, 
with low-intensity broad features at 1120 + 
5 cm‘, and occasional broad 3450 + 5 cm™’ 
hydration features, consistent with a mixture 
of sulfate and perchlorate that is minimally 
hydrated (Fig. 3G). A Ca-sulfate species best 
explains these spectra when coupled with ele- 
mental chemistry data from PIXL and SuperCam 
(8). Two detected 965 cm™ peaks are likely 
phosphates, although perchlorates cannot be 
excluded. 

Bellegarde contains white 0.5- to 1-mm sec- 
ondary crystals with well-formed and sharp 
crystal faces and reddish-brown semi-isopachous 
rims (8) (fig. $2). We interpret these as void 
fills within a basaltic igneous rock. These crys- 
tals exhibit 1010- to 1020-cm * peaks, similarly 
attributed to Ca-sulfate when coupled with 
elemental chemistry data (8). Several of the 
sulfate peaks are also associated with a nar- 
row low-intensity hydration feature at 3560 + 
5 cm’, consistent with hydrated Ca-sulfates 
(fig. $3). The Bellegarde target contains a 
single 1080 + 5 cm’ peak of possible Ca- 
carbonate (Fig. 4). Narrow peaks at 975 cm 
could not definitively be identified and could 
be phosphate or perchlorate (8). The SHERLOC 
mineral detections within the Bellegarde and 
Guillaumes targets are consistent with the 
results from other Perseverance instruments 
(8, 9, 11) (figs. S5 and S6). 
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Fig. 1. Rover images of the three abraded targets and their orbital context. (A) Map of orbitally 
defined geological units within Jezero crater (12), including the Crater Floor Fractured Rough unit (CF-fr) 
equivalent to the Maaz Fm and the Crater Floor Fractured 1 (CF-fl) unit equivalent to the Séftah Fm. The 
star shows the landing site, and white circles show the positions of the three abraded targets. The black 
outlines indicate the locations shown in (B) and (C). [Adapted by permission from Springer Nature Customer 
Service Center GmbH, Springer Nature Space Sci. Rev. (12), copyright (2020)] (B) Orbital infrared 
spectroscopy map showing the location of pyroxene- or olivine-bearing materials in the study area from (4). 
The numbered white circles correspond to the images shown in (G). (C) High Resolution Imaging Science 
Experiment (HiRISE) view of the study area (30). The numbered white circles correspond to the images 
shown in (G). (D) Mastcam-Z image showing the Garde patch on the Bastide outcrop. (E) Hazard avoidance 
camera (Hazcam) image showing the Bellegarde patch on the Rochette rock. (F) Navigational camera 
(Navcam) image showing the Guillaumes patch on the Roubion outcrop. (G) WATSON images of abraded 
targets analyzed in this study. Grayscale images are available in figs. S9 to S11. 


Guillaumes and Bellegarde targets com- 
monly exhibit a weak, broad fluorescence fea- 
ture with a maximum at ~340 nm (Figs. 3D 
and 4D) that appears to be widely distrib- 
uted across each surface and is occasionally 
correlated with reddish-brown materials. Al- 
though this feature sometimes co-occurs with 


perchlorate, sulfate, and possibly phosphate, it 
occurs equally often in areas with unidentified 
mineralogy. Bellegarde has two other signa- 
tures at ~275 and ~305 nm (Fig. 4D), which are 
strong and localized on specific, light-toned 
features. The ~305-nm signature is associated 
with detected sulfate (Fig. 4, D to F, and fig. S2). 
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Fig. 2. SHERLOC Raman and fluorescence results for the Garde abraded 
patch. (A) WATSON image. The yellow outline indicates the region shown in (B). 
(B) Context image and scan outlines. Grayscale versions of the context image 
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with data superimposed are shown in (C), (D), (F), and (G). (C) Fluorescence 
map showing the intensity of three main features centralized at 340, 305, and 
285 nm in red, green, and blue, respectively. (D) Raman mineral maps showing 
the location of detected olivine, carbonate, phosphate, and weak amorphous 
silicate features. Purple and green outlines indicate ROIs 1 and 2, numbered in 
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In Guillaumes, a second fluorescence signature 
at ~275 nm (Fig. 3D) is observed in two locations, 
which are ~300 um in diameter, and are coinci- 
dent with previous SuperCam laser spots (8). 
Observation of olivine and carbonate mix- 
tures within the Garde target of the Séitah Fm 
is consistent with orbital infrared observations 
(2-4) and substantiated by multiple lines of 
evidence (9-11). Previously proposed hypothe- 
ses for the precipitation of these carbonates 
include low- and high-temperature aqueous 
alteration of olivine-rich igneous materials, 
which we will subsequently refer to as ultra- 
mafic protolith (3, 15-17), or precipitation from 
lake or groundwater (4, 75-17). Our 10- to 
100-um-scale textural and spectroscopic evi- 
dence supports carbonate formation through 
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aqueous alteration of an ultramafic protolith, 
known as carbonation. The supporting evi- 
dence includes the following: (i) Carbonate 
cation compositions are consistent with those 
of olivines, suggesting that mixed Fe- and 
Mg-olivine gave rise to mixed Fe- and Mg- 
carbonates, similar to carbonation environ- 
ments on Earth and within martian meteorites 
(16, 18, 19). (ii) The observed carbonates co- 
occur with hydrated materials (9) and poten- 
tially aqueously formed amorphous silicates 
and phosphate. (iii) The spectral and textural 
variation of olivine- and carbonate-dominated 
zones within both primary grains and inter- 
granular spaces are expected for carbonated 
ultramafic protoliths on Earth (16) and within 
martian meteorites (78, 19). 


These observations suggest that the degree 
of aqueous alteration to the ultramafic pro- 
tolith was not pervasive, because large olivine- 
rich domains remain intact. By contrast, the 
alteration of the primary lithology is pervasive 
and occurs throughout (not in specific spatial 
domains, for example, fractures). In ultramafic 
alteration environments on Earth (16) and in 
martian meteorites (18, 19), carbonation can 
be associated with the formation of oxides, 
hydroxides, and/or Fe- and Mg-rich phyllosi- 
licates, which have not been observed (9). 
Carbonation can occur under a wide range of 
temperatures from low to ambient to hydro- 
thermal and metamorphic (15-17). Other alter- 
ation minerals, such as serpentine, have not 
been definitively observed in the Séitah Fm to 
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Fig. 3. SHERLOC Raman and fluorescence results for the Guillaumes 
abraded patch. (A) WATSON image. Two SHERLOC scans (yellow outlines) 
and one PIXL scan (black outline) are shown. The yellow outline on the right 
indicates the region shown in (B). (B) Context image and scan outlines. 
Grayscale versions of the context image with data superimposed are shown 

in (D) to (F). SNR, signal-to-noise ratio. (©) Average Raman spectrum compared 
with laboratory measurements of amorphous silicate, Na-perchlorate, and 
anhydrite. Laboratory spectral features at 1500 to 1600 cm are Oz (vertical, 
dotted line) and organic contaminants. (D) Fluorescence map showing the 


date, which could suggest time-limited interac- 
tions, low water-to-rock ratios, or ambient fluid 
temperatures during carbonation (3, 15-17). 
The similarity between the mineralogy of 
the Garde target in the Séitah Fm to the 
surrounding widespread, regional olivine- 
carbonate-bearing unit with a similar orbital 
spectroscopic signature and geomorphological 
texture (3, 4, 15, 16) suggests that carbonation 
of olivine may have occurred throughout this 
extensive region on ancient Mars (~2.7 billion 
to 3.8 billion years ago). These observations 
parallel those made by the Spirit rover in Gusev 
crater (20) and within (1.3 billion to 4 billion 
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years ago) martian meteorites (18, 19). Model- 
ing has suggested that carbonate deposition 
could have played a role in the evolution of 
Mars’ atmosphere (3, 17, 21), but the geolog- 
ical nature of such a depositional mechanism 
had remained unexplained. Taken together, 
micron-scale SHERLOC observations of this 
phenomenon complement previous orbital and 
meteorite observations and demonstrate the 
alteration of igneous materials that results in 
geological deposition of carbonates. 

Jezero crater perchlorate detections, similar 
to those found by the Phoenix lander (22), have 
been substantiated using three independent 
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intensity of three main features centralized at 340, 305, and 275 nm in red, 
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maps showing the locations of detected perchlorate, Ca-sulfate with and without 
hydration, and 965 cm™ peaks. The numbers correspond to the spectra shown 


from SHERLOC (1 to 4) [positions indicated with 


numbers in (E) and (F)] compared with laboratory measurements. The vertical 
dashed lines indicate the peak positions of matching laboratory spectra. 


able in figs. S16 to S19. 


instruments (9). Previous evidence for mar- 
tian perchlorates also includes observations by 
the Curiosity rover (23), proposed but later 
disputed orbital detections (24), and detection 
within the Tissint meteorite (25). The SHERLOC 
perchlorate detections differ from previous mis- 
sion observations because they are observed 
within the interior of a rock and not on the 
surface, are related to aqueous processes, and 
are likely Na-perchlorate [not previously de- 
tected Ca-, Fe-, or Mg-perchlorates (23)]. 
Previous hypotheses for perchlorate forma- 
tion on Mars are (i) irradiation of chlorine- 


bearing parent minerals (26), (ii) atmospheric 
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Fig. 4. SHERLOC Raman and fluorescence results for the Bellegarde 
abraded patch. (A) WATSON image. The yellow outline indicates the region 
shown in (B). (B) Context image and scan outlines. Grayscale versions of 

the context image with data superimposed are shown in (D) to (F). (€) Average 
Raman spectrum compared with laboratory measurements of amorphous 
silicate and obsidian. Laboratory spectral features at 1500 to 1600 cm™ are 
Oz» (vertical, dotted line) and trace organic contaminants. (D) Fluorescence 
map showing the intensity of three main features centralized at 340, 305, and 


oxidation of chlorine species (27), or (iii) for- 
mation from brines (25). Perchlorates could 
also be mobilized in thin films of fluid (23). The 
Jezero perchlorates occur in white void fills 
within rock interiors and did not form direct- 
ly on the surface as expected from materials 
formed by cosmic irradiation or atmospheric 
oxidation, indicating either formation or mo- 
bilization through briny fluids after basalt for- 
mation. The concomitant detection of sulfates 
and perchlorates within the Guillaumes target 
suggests that sulfate formed together with 
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cies, such as halite, within percolating briny 
waters that were then subsequently oxidized 
to perchlorate. Bellegarde exhibited sulfate 
without perchlorate, suggesting that these 
brines did not precipitate chlorine-bearing 
parent species, that perchlorate formation was 
not pervasive, or that perchlorates have since 
been dissolved. Perchlorates are easily dissolved, 
and therefore perchlorates likely formed when 
these rocks were last exposed to liquid water. 
Perchlorate and sulfate detections within the 
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Grayscale images are available in figs. S20 to S23. 


stratigraphically younger Méaz Fm (9) substan- 
tiates an aqueous environment on Mars that 
occurred separately from the stratigraphically 
older (9) Séitah Fm carbonation environment. 

Deep-ultraviolet fluorescence is particularly 
sensitive to aromatic organic compounds, and 
the fluorescence signatures observed in all 
three targets are consistent with emission from 
aromatic organic compounds that contain one 
or two fused aromatic rings and/or aromatic 
heterocycles (7, 8, 28) (fig. S7). Although fluo- 
rescence signatures cannot be assigned to 


9 DECEMBER 2022 * VOL 378 ISSUE 6624 1109 


RESEARCH | REPORTS 


specific organic compounds, the ~340-nm 
fluorescence is consistent with a base struc- 
ture of two-ring aromatic organics like naph- 
thalene, whereas ~275- to 285-nm fluorescence 
is more consistent with one-ring aromatic or- 
ganics such as benzene (8, 28). The ~305-nm 
fluorescence could indicate either one- or two- 
ring aromatics, depending on functional groups. 
We interpret ~305- and ~275-nm fluorescence 
as organics that occurred with sulfates within 
the Bellegarde target (Fig. 4, D to F), whereas 
we interpret ~285-nm fluorescence as organ- 
ics that occurred with carbonate-phosphate- 
amorphous silicate brown microcrystalline 
alteration zones within the Garde target (Fig. 2) 
(8). The ~340-nm fluorescence co-occurs with 
carbonate-phosphate-amorphous silicate altera- 
tion zones in Garde but is not associated with 
particular phases in Guillaumes and Bellegarde 
(8). This phosphate-correlated 340-nm fluores- 
cence could be explained by organics in the 
same alteration zone and/or a minor compo- 
nent of fluorescent cerium present within the 
phosphate (8). In Guillaumes and Bellegarde, 
340-nm fluorescence is predominantly un- 
coupled from phosphate detections, suggest- 
ing that part of this signal is best assigned to 
organics (8). However, that some or all of the 
~340-nm fluorescence signal is from cerium 
cannot be excluded (8). 

When combined with Curiosity observations 
of organic material in mudstones (29), the 
presence of organic material in igneous rocks 
would imply a diverse relationship between 
geological processes and organic compounds 
on Mars. Several explanations for the presence 
of martian organics are possible, for example, 
infall of meteoritic material (6), in situ synthesis 
mechanisms (18, 19, 25), or a putative relic 
martian biosphere. The association between 
organics and sulfate-, phosphate-, perchlorate-, 
carbonate-, and amorphous silicate-bearing 
mineralogy, as well as alteration textures, 
suggests that aqueous alteration of igneous 
rocks could have played a role in the preser- 
vation (or synthesis) of these organics, as 
seen for similar organics-mineral correla- 
tions in martian meteorites (18, 19, 25). How- 
ever, potentially organic, widespread ~340-nm 
fluorescence could suggest that other pro- 
cesses, such as dust, contributed to detections. 
Some mineral phase associations might not 
be apparent because of instrumental limi- 
tations (8). 

We did not detect Raman peaks consistent 
with aromatic organic compounds, such as 
the C=C stretching mode (or G band) around 
~1600 cm‘. However, Raman scattered light 
is several orders of magnitude weaker than 
fluorescence (8, 28). Organic concentrations 
were possibly insufficient to produce detectable 
Raman scattering, either because of low orig- 
inal concentration or subsequent degradation. 
We estimate a range from 5 x 10" to3 x 10° g 
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of aromatic organics in localized scan points 
that encompass an estimated 6 x 107” g of rock 
(8). Estimates from the average fluorescence 
maps suggest a bulk concentration of 0.1 to 10 
parts per million (ppm), with higher concen- 
trations associated with more aqueously al- 
tered surfaces (8). This is consistent with 
known bulk concentrations of organics that 
contain one- and two-ring aromatic species and 
are indigenous to martian meteorites [11.2 + 
6.9 ppm (J8)] and Curiosity rover detections 
in mudstones [~70 parts per billion by weight 
to 10.6 + 8.9 ppm (6, 29)]. 

Collectively, these data show that the drilled 
samples collected by Perseverance from the 
floor of Jezero crater are likely to contain evi- 
dence for carbonation and formation of sul- 
fates and perchlorates. Fluorescence signatures 
consistent with organics that are present with- 
in these materials indicate an interplay be- 
tween igneous rocks, aqueous alteration, and 
organic material on Mars. 
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ANTIBIOTIC RESISTANCE 


Tuberculosis treatment failure associated with 
evolution of antibiotic resilience 


Qingyun Liu}, Junhao Zhu}, Charles L. Dulberger’?+, Sydney Stanley’, Sean Wilson7§, 
Eun Seon Chung*“, Xin Wang’, Peter Culviner’, Yue J. Liu’, Nathan D. Hicks’, 
Gregory H. Babunovic?, Samantha R. Giffen’, Bree B. Aldridge**, Ethan C. Garner’, Eric J. Rubin’, 


Michael C. Chao’, Sarah M. Fortune'>* 


The widespread use of antibiotics has placed bacterial pathogens under intense pressure to evolve 
new survival mechanisms. Genomic analysis of 51,229 Mycobacterium tuberculosis (Mtb) clinical isolates 
has identified an essential transcriptional regulator, Rv1830, herein called resR for resilience regulator, 
as a frequent target of positive (adaptive) selection. resR mutants do not show canonical drug resistance or 
drug tolerance but instead shorten the post-antibiotic effect, meaning that they enable Mtb to resume 
growth after drug exposure substantially faster than wild-type strains. We refer to this phenotype as 
antibiotic resilience. ResR acts in a regulatory cascade with other transcription factors controlling cell 
growth and division, which are also under positive selection in clinical isolates of Mtb. Mutations of these 
genes are associated with treatment failure and the acquisition of canonical drug resistance. 


ntibiotics are designed to kill bacteria by 

inhibiting biological processes essential 

for their survival. Completely eliminat- 

ing a population of bacteria with anti- 

biotics is challenging even when the 
bacteria are nominally sensitive to the drug (1). 
There is a growing body of work defining the 
mechanisms beyond canonical resistance by 
which bacterial populations avoid clearance 
by antibiotics (2, 3). These include the forma- 
tion of privileged subpopulations of drug- 
tolerant bacteria, as well as the acquisition 
of other traits that enhance bacterial survival, 
including variations in the cellular require- 
ments for the target and alterations in drug 
penetration or efflux (2-4). We hypothesized 
that bacterial adaptation to antibiotics would 
be recorded as mutations in the genes asso- 
ciated with the most clinically relevant of these 
adaptive processes, and determining the tar- 
gets of natural selection would reveal new 
mechanisms that enable bacteria to survive 
antibiotics in patients. Therefore, we sought 
to identify the evolutionary signatures of 
adaptation to antibiotics in Mycobacterium 
tuberculosis (Mtb). This obligate human path- 
ogen has remained one of the largest causes 
of mortality despite facing months of anti- 
biotic pressure in every treated patient and 
having been under antibiotic selection glob- 
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ally since the introduction of streptomycin 
in 1944 (5). 


Ongoing positive selection in the 
Mtb population 


When clinical Mb isolates are sequenced, the 
specimen is typically not derived from a single 
colony, but rather represents the bacterial 
population sampled from the patient. This 
population contains mutations arising from 
within-host evolution of the bacterium, which 
can be tracked by identifying unfixed muta- 
tions detected in deep sequencing (Fig. 1A) 
(6, 7). We reasoned that by looking at within- 
host evolution across very large numbers of 
clinical isolates, even in the absence of clinical 
metadata, we could identify genes that re- 
peatedly acquire mutations (parallel evolu- 
tion) within different patients that would mark 
processes under positive selection. Therefore, 
we assembled previously published whole- 
genome sequencing data from 51,229 Mtb 
clinical isolates (Fig. 1B) and used a mutation 
burden test and the ratio of nonsynonymous 
to synonymous mutations (dN/dS) to identify 
genes that are under positive selection in the 
Mtb population (see the materials and meth- 
ods). We also identified intergenic regions (IGRs) 
that are highly enriched for mutations. As ex- 
pected, genes and IGRs previously associated 
with drug resistance or tolerance were highly 
mutated and the genes had dN/dS ratios >1, 
which is indicative of positive selection (Fig. 
1B and fig. S1). We also identified a second set 
of genes that showed similar selective signals 
but for which the selective pressures are un- 
known (Fig. 1B). 

Functional enrichment analysis indicated 
that the genes under selection are highly en- 
riched for transcriptional regulators (12/60, 
P = 1.57 x 10°; Fig. 1, B and C). One of the 
most frequently mutated genes is resR (gene 


identifier: Rv1830), an essential gene that is 
predicted to be a merR-type regulator. We also 
found evidence of positive selection in an anal- 
ysis of the fixed mutations in resR (dN/dS = 
2.91). The pattern of mutation was very similar 
between unfixed and fixed mutations (fig. S2), 
suggesting that within-host selection leads 
to mutation fixation. The dN/dS of resR was 
higher in drug-resistant strains compared 
with drug-sensitive strains (Fig. 1D), suggest- 
ing that vesR mutations are associated with 
the evolution of drug resistance and consistent 
with a recent study showing that 7esR muta- 
tions were associated with a multidrug-resistant 
tuberculosis (TB) phenotype (8). Protein struc- 
tural modeling indicated that most resR clinical 
mutations occur in the merk-type helix-turn- 
helix DNA-binding domain, with the most com- 
mon mutations located at the interface between 
ResR and duplex DNA (Fig. 1E and fig. $2), 
where they could affect DNA-binding affinity 
and the transcription of downstream genes. 


resR mutants display antibiotic resilience 


To investigate the functional consequence of 
resR mutations, we used oligo-mediated re- 
combineering to make point mutant strains by 
individually introducing three common clin- 
ical mutations (P59L, A85V, and R95C) into 
the chromosomal copy of resR in Mtb (H37Rv) 
(Fig. 1F). These vesR mutations did not affect 
bacterial growth under standard or host- 
relevant carbon source conditions (fig. $3, A 
and B). Given the association with drug resis- 
tance, which occurs through de novo chromo- 
somal mutation in Mtb, we also assessed the 
effect of resR mutations on the bacterial muta- 
tion rate, but did not find alterations (fig. S3C 
and table S1). However, we noted that resR 
mutants were ~20% longer and ~5% wider 
than the wild-type cells (fig. S4) and had a 
thickened cell envelope (fig. S5), suggesting 
that mutations of vesR have functional con- 
sequences on bacterial size control under stan- 
dard growth conditions. 

We next tested the effects of resR mutations 
on drug resistance by measuring the mini- 
mum inhibitory concentration (MIC) for a panel 
of eight antibiotics, including first-line, second- 
line, and new anti-TB drugs (Fig. 2A). resR 
mutants and wild-type strains had very similar 
MIC profiles, with only subtle MIC shifts iden- 
tified: a small increase for isoniazid (0.01 to 
0.02 ug/ml) and minor decreases for rifam- 
picin and ofloxacin (Fig. 2A and fig. S6). These 
changes are far below the MIC designated 
for being classified as drug resistant (Fig. 2A) 
(9). We next assessed the effects of resR muta- 
tions on drug tolerance by performing time-kill 
analyses for the eight antibiotics using drug 
concentrations 100-fold MIC. The resR mutants 
and wild-type strains showed very similar time- 
kill dynamics, although again, there were 
subtle differences for some drugs (ofloxacin, 
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ethionamide, and bedaquiline) at particular 
time points (Fig. 2B). The changes in the mini- 
mum duration of time to kill 99% of bacteria 
(MDKog) were <24 hours (Fig. 2B); previously 
characterized drug-tolerance mutants in Mtb 
prolonged the MDKgg by more than 4 days (10). 

In contrast to these modest changes in MIC 
and MDKop, in the time-kill assays, we ob- 
served a strong and unexpected drug pheno- 
type that clearly distinguished resR mutants 
from the wild-type strains. Once the cells were 
plated on drug-free media after antibiotic ex- 
posure, vesR mutants formed visible colonies 
significantly earlier than the wild type (Fig. 
2C). This early recovery phenotype of resR 
mutants was observed for all eight antibiotics 
tested, whereas no difference in growth was 
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seen in the absence of drug exposure (Fig. 2C 
and fig. S7). Using quantitative image anal- 
ysis to track the growth of individual colonies 
over time (Fig. 3A and fig. S8) (1D, we found 
that antibiotic treatment caused a 4.7- to 
6.8-day delay in colony formation for wild- 
type Mib; these delays were highly reproduc- 
ible and characteristic for each drug (Fig. 3B). 
The vesR mutants reduced the time to colony 
formation by 20 to 50% for all eight drugs 
tested (Fig. 3, B and C). The resR mutants also 
showed faster recovery when challenged by 
combinations of the first-line anti-TB drugs 
(fig. S9, A to C). We calculated the growth 
dynamics of traced colonies over time and 
found that the expansion rate of visible colonies 
did not differ between WT and mutant cells 
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(fig. S9D), suggesting that faster recovery oc- 
curred at early stages before colony appearance. 

A delay in bacterial growth after a period of 
antibiotic exposure is classically referred to as 
the post-antibiotic effect (PAE) (12). The PAE 
has been recognized for decades across many 
bacterial pathogens and is an important factor 
in the design of treatment regimens (12, 13). 
To date, the PAE remains mechanistically 
enigmatic, with recent work favoring a de- 
toxification model in which the cells must 
export residual drug before growth resumes 
(14). Our results suggest that Mtb has mecha- 
nisms for post-antibiotic recovery that are under 
genetic control, and that resR mutants optimize 
this system, accelerating bacterial recovery 
after a wide variety of antibiotic insults. We 
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Fig. 2. resR mutants show faster recovery after drug exposure. (A) MICs 


of resR mutants and wild type for eight anti-TB drugs: 
rifampicin; OFX, ofloxacin; ETH, ethionamide; EMB, eth 


D-cycloserine; BDQ, bedaquiline; LZD, linezolid. The drug-resistance breakpoint 
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refer to this phenotype as antibiotic resilience 
and thus named this gene Rv1830, resR for 
resilience regulator. 


Antibiotics resilience manifests as 
early recovery 


To better understand early events in post- 
antibiotic recovery, we used microscopy to 
investigate cell regrowth after the period 
of antibiotic exposure. Mtb grow from their 
subpolar regions, which can be quantitatively 
defined by imaging that tracks the incorpo- 
ration of fluorescently labeled D-amino acids 
(FDAAs) into the nascent peptidoglycan near 
the cell poles (15, 16). We assessed regrowth of 
wild-type and resR mutant cells after 24 hours 
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stance. (B) Time-kill Mtb strains. 
lowed by a 10- to 36-hour recovery window in 
drug-free media (Fig. 4, A and B). During the 
recovery period, 7esR mutants showed signif- 
icantly more new cell wall synthesis than the 
wild-type cells (P < 0.0001 for 24 and 36 hours; 
Fig. 4, B and C). Wild-type cells and resR mu- 
tants had similar FDAA incorporation in the 
absence of antibiotic exposure (Fig. 4B). 
This labeling approach allowed us to use 
flow cytometry to assess new cell wall synthe- 
sis rapidly and quantitatively in mutant and 
wild-type cells across a wide range of anti- 
biotic exposure conditions. We assessed re- 
growth after exposure to eight antibiotics at 
100-fold MIC. resR mutants showed increased 
cell wall synthesis after exposure to all anti- 


of isoniazid exposure at 100-fold MIC, fol- 
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kinetics for eight different drugs for resR mutant and wild-type Mtb strains. 
The concentration of each drug was 100-fold MIC. The dashed line indicates 
MDKgg. *P < 0.05 by unpaired t test. (C) Representative images illustrating the 
post-antibiotic recovery dynamics of resR mutant (R95C) and wild-type (WT) 


similar between resR mutants and the wild 
type in the absence of drug (Fig. 4D). The dif- 
ferent resR mutants all showed a similar 
phenotype (fig. S9E). resR mutants also dem- 
onstrated faster recovery after up to 120 hours 
of isoniazid exposure and across a range of 
antibiotic concentrations (fig. S10). 


A regulatory pathway underlying 
antibiotic resilience 


To delineate the regulatory targets through 
which ResR mediates antibiotic resilience, we 
used an in vitro whole-genome DNA binding 
and deep sequencing assay (IDAP-Seq) (17) 
to identify the ResR-binding sites across the 
genome. We purified ResR proteins with either 
an N-terminal or a C-terminal His tag, and 
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Fig. 3. Quantitative colony size tracking 
indicates shortened PAE in resR mutants. 
(A) Quantitative colony size tracking in 
pixels (px.) for resR mutant and wild-type 
Mtb strains in the presence or absence 

of antibiotic exposure for 24 hours (24h). 
(B) Duration from plating to the appearance 
of visible colonies (time to colony 
formation) for resR mutant and wild-type 
strains after exposure to the indicated 
antibiotics (24h) or no drug exposure 
(none). P = 0.7291 for the no-drug-exposure 
group and P < 0.0001 for all drug groups, 
as determined by Mann-Whitney U test. 

(C) Bar plot depicting the median values 

of post-antibiotic delay of wild-type 

(light green) and resR mutant strains 

(dark green). 


performed IDAP-Seq with both versions. With 
both constructs, we identified four major ResR- 
binding sites in the Mtb genome in the interge- 
nic regions between 7777J-Rv0996, nrdH-Rv3054, 
whiB2-fvbiA, and Rv3916c-parB (Fig. 5A). None 
of the genes flanking the dominant ResR bind- 
ing sites has a known role in drug resistance. 
Unexpectedly, however, our population geno- 
mic analyses found that one of them, the 
whiB2-fbiA intergenic region, is also one of 
the major targets of ongoing positive selection 
in clinical Mtb isolates (Fig. 1B). Indeed, the 
ResR-binding site in the whiB2-/biA interge- 
nic region is the site of mutant accumulation 
in clinical Mtb strains (Fig. 5B), an evolution- 
ary pattern suggesting that natural selection 
has acted on both the ResR and its binding 
site, presumably to alter the expression of 
whiB2 or fbiA. 

To assess the regulatory role of ResR, we 
used RNA sequencing to assess gene expres- 
sion changes in the three independent vesR 
mutants compared with isogenic wild-type 
Mtb. The resR mutants had ~2-fold higher 
levels of whiB2 transcript but no significant 
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alterations in the expression of the other genes 
adjacent to the major ResR-binding sites (Fig. 
5C). The changes in whiB2 expression were 
part of a pattern of altered transcription that 
also included increased expression of the 
iniBAC operon, which was previously shown 
to be expressed in response to cell wall- 
targeting antibiotics (78) and other mediators 
or regulators of central carbon metabolism 
(e.g., bkdABC and clgR) and cell wall-acting 
genes, notably pbpB (19, 20) (fig. S11A). To fur- 
ther define the regulatory relationship be- 
tween resR and whiB2, we reconstructed several 
resR variants in Mycobacterium smegmatis 
(homolog: MSMEG_3644), including CRISPR-i 
knockdown, overexpression, and point-mutant 
strains (Fig. 5D). Transcriptional profiling of 
these strains suggested that ResR functions 
as an activator of whiB2 expression, and that 
the clinical mutation T77A results in increased 
whiB2 transcription (Fig. 5D and fig. S11B). 
WhiB2 is an essential regulator that controls 
cell growth division in mycobacteria. Knockdown 
of whiB2 expression results in cell elongation 
and division defects, whereas overexpression 
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of whiB2 in Mtb results in increased cell length 
(21, 22). We next sought to determine the ef- 
fect of the whiB2-/biA mutation on whiB2 ex- 
pression and Mtb physiology. We generated 
isogenic Mzb strains carrying a clinically prev- 
alent mutation in whiB2-fbiA (3640375 C>T). 
The point-mutant strain had an ~1.6-fold in- 
crease in whiB2 expression (Fig. 5E and fig. 
$12A). Moreover, this strain also phenocopied 
resR mutants, with similar morphotypic changes 
donger and wider cells) (fig. S12B), a small in- 
crease of isoniazid MIC, and, most importantly 
(fig. S12C), faster recovery after antibiotic 
exposure (fig. S12, D and E). However, the 
magnitude of the resilience phenotype in the 
whiB2-fbiA mutant appeared to be smaller 
than that of vesR mutants, suggesting that ad- 
ditional, as yet undefined, mechanisms might 
be involved. 


resR, whiB2-fbiA, and whiA mutants are 
associated with acquisition of canonical drug 
resistance and treatment failure 


The WhiB2 regulon has been best characterized 
in Streptomyces, in which the WhiB2 homolog 
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Fig. 4. Antibiotic resilience characterized by more rapid resumption of cell wall 
synthesis. (A) Schematic diagram of a pulse-chase experiment for tracking 

the regrowth of Mtb cells after antibiotic exposure using FDAA incorporation (NADA 
and HADA). (B) Representative microscopy snapshots of the resR mutants and wild 
type during regrowth after 24 hours of INH (100-fold MIC) exposure (top) and no 


antibiotic exposure (bottom 


AlexaFluor488-H (NADA) 


). (C) Quantitative comparison of outgrowth between 


resR mutant and wild type in the post-antibiotic-exposure group and the no-antibiotic- 


exposure group; P < 0.000 
by double-sided Kolmogoro' 
tion into Mtb cells after 24 


functions with another regulator, WhiA, to 
initiate cell division (23, 24). We found in 
our population genomic analysis that whiA, 
like resR and whiB2, is also a major target of 
positive selection in clinical Mtb isolates (Fig. 
1B). To assess the clinical relevance of muta- 
tions in this regulatory triad, we compared the 
prevalence of fixed mutations of resR, whiB2- 
fbiA, and whiA in drug-susceptible strains 
versus drug-resistant strains. Of the 51,229 
genomes in our database, 6.9% had mutations 
in these genes or IGRs, and mutations in these 
genes were significantly enriched in drug- 
resistant strains (fig. S13A). We further looked at 
Mtb strains from two high-TB-burden countries, 
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India and China, where Mtb populations were 
not skewed by recent expansions of clonal Mtb 
strains, and each had >1000 sequenced iso- 
lates. Again, strains with mutations in the 
three genes or IGR had increased odds ratios 
(ORs) for drug resistance: India, OR: 1.80; 95% 
confidence interval (CT): 1.11~2.91; China, OR: 
1.71; 95% CI: 1.06~2.74 (Fig. 6A and fig. S13B). 
Therefore, these data suggest that resR, whiB2- 
SbiA, and whiA mutations facilitate the evo- 
lution of canonical drug resistance. These 
mutations remain under positive selection in 
highly drug-resistant strains, as assessed by 
within-host evolution (Fig. 6B), suggesting 


for 24 and 36 hours of post-antibiotic-exposure group 
v-Smirnov test. (D) Flow cytometry for NADA incorpora- 
hours of recovery with or without antibiotic exposure. 


to first-line drugs, they provide an advantage 
to the bacterium as it encounters second-line 
agents. Evidence of selection at every level of 
drug resistance further supports the model 
that these variants are selected by a multi- 
drug phenotype such as antibiotic resilience. 
However, the shifted isoniazid MIC is likely to 
contribute to their success in drug-susceptible 
strains, and this dual benefit may be one 
reason that they are clinically prevalent. 
Finally, although resR, whiB2-fbiA, and whiA 
variants are more frequent in drug-resistant 
strains, variations in these genes are also 
under positive selection in drug-susceptible 


that even after the emergence of resistance 


strains (fig. S13C), and the sites of mutation 
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Fig. 5. ResR activates whiB2 and clinical mutations leading to up-regulation of 
whiB2. (A) |DAP-Seq identified binding sites of ResR. (B) ResR-binding peaks 
overlaid with the mutations identified in clinical strains between whiB2 and 
fbiA. The transcriptional start sites of whiB2 and fbiA are indicated by green 
arrows. (C) Fold change of transcription of putative ResR targets in resR 
mutants and wild-type strains. P values were determined by the Wald test 
implemented in DEseq2. (D) Transcriptional changes of whiB2 in 


M. smegmatis strains: wild-type (WT), resR point mutant (177A), CRISPR-i 
knock-down of resR (KD), merodiploid overexpression of wild-type resR 
(OE) or T77A mutant form (OE-T77A). The absence or presence of aTc is 
indicated by (—) or (+). P values were determined by unpaired t test. 

(E) The whiB2-fbiA mutant (3640375 C>T) exhibited up-regulation of whiB2. 
whiB2-1 and whiB2-2 refer to the two parallel mutants. P values were 
determined by unpaired t test. 


in drug-susceptible strains mimic those in 
drug-resistant strains (fig. S13D). Indeed, 1.5 to 
9.7% of drug-susceptible Mtb strains from the 
high-TB-burden countries had fixed mutations 
in one of these three genes or IGRs (fig. SI3E). 
We postulated that in addition to being clin- 
ically important as stepping-stone mutations 
for the emergence of canonical drug resis- 
tance, these variants independently contrib- 
ute to treatment failure in drug-susceptible 
patients. To test this hypothesis, we rean- 
alyzed data from the REMoxTB trial, a global 
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phase 3 clinical trial that sought to reduce 
the treatment duration of drug-susceptible TB 
(25, 26). Of the 1931 patients enrolled in the 
trial, paired Mtb isolates from 36 recurrent TB 
patients were whole-genome sequenced and 
the isolates remained drug sensitive (26). We 
found that eight of 36 (22.2%) isolates from 
patients who failed treatment had resR, whiB2- 
SviA, or whiA mutations (Fig. 6C), a frequency 
significantly higher than found in the back- 
ground populations of South Africa (108/2998, 
3.6%) and Thailand (110/2036, 5.4%), where 


the trial was conducted (P = 0.0024 and P = 
0.0152, respectively, Fisher’s exact test; Fig. 6D). 
Additionally, the within-host frequency of 
these variants increased between the initial 
and recurrent episodes, in three cases rising to 
fixation (Fig. 6E). 

We have identified a new form of altered 
drug susceptibility that we call antibiotic re- 
silience, which is regulated by the gene resR. 
Mutations in the regulating cascade are positively 
selected in clinical Mtb isolates and associated 
with poor treatment outcomes in TB. Current 
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Fig. 6. resR, whiB2-fbiA, and whiA mutants are associated with canonical 
drug resistance and relapse of drug-susceptible TB. (A) Proportion of 

resR, whiB2-fbiA, and whiA mutants in DR and DS strains sequenced from 
India and China. P values were determined by Fisher's exact test. (B) Unfixed 
mutations in DS, DR, and MDR (resistant to RIF and INH) strains. 

(C) Phylogenetic tree of paired Mtb isolates from 36 recurrent TB patients. 
Solid stars indicate isolates in which mutations were detected, and empty stars 


clinical measures of antibiotic susceptibility 
focus largely on the magnitude of drug expo- 
sure; that is, what concentration of drug the 
pathogen can experience and grow or at least 
survive (27). However, our results suggest that 


in the clinical setting, Mtb is evolving to change (2016). 
: A 4. H. Safi et al., Proc. Natl. Acad. Sci. U.S.A. 116, 19665-19674 
the temporal dynamics of its recovery after (2019), 20. 


drug exposure. The temporal dynamics of drug 
responses are considered when developing new 
drugs and drug regimens but are often forgot- 
ten as potential drivers of treatment outcomes 
that might be influenced by pathogen varia- 
tion. In studies of Escherichia coli experimen- 
tally evolved under intermittent drug exposure, 
temporal variants were the first to emerge (28). 
Further, the mutants had delayed regrowth 
rather than faster regrowth, suggesting that 
the temporal phenotypes could vary by the 
pathogen, drug regimen, and host factors such 
as drug metabolism (29-37). Because current 
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CRYSTALLIZATION 


Liquid metal synthesis solvents for metallic crystals 


Shuhada A. Idrus-Saidi’+, Jianbo Tang’*+, Stephanie Lambie’, Jialuo Han’, Mohannad Mayyas?, 
Mohammad B. Ghasemian’, Francois-Marie Allioux', Shengxiang Cai’, Pramod Koshy°, 
Peyman Mostaghimi‘, Krista G. Steenbergen®, Amanda S. Barnard®, Torben Daeneke”*, 


Nicola Gaston“, Kourosh Kalantar-Zadeh'®* 


In nature, snowflake ice crystals arrange themselves into diverse symmetrical six-sided structures. 

We show an analogy of this when zinc (Zn) dissolves and crystallizes in liquid gallium (Ga). The low- 
melting-temperature Ga is used as a “metallic solvent” to synthesize a range of flake-like Zn crystals. 
We extract these metallic crystals from the liquid metal solvent by reducing its surface tension using 
a combination of electrocapillary modulation and vacuum filtration. The liquid metal-grown crystals 
feature high morphological diversity and persistent symmetry. The concept is expanded to other 
single and binary metal solutes and Ga-based solvents, with the growth mechanisms elucidated through 
ab initio simulation of interfacial stability. This strategy offers general routes for creating highly 
crystalline, shape-controlled metallic or multimetallic fine structures from liquid metal solvents. 


erhaps the most captivating example 
of crystal growth is the formation of 
six-branched ice crystals (snowflakes) 
(7, 2). Although they are distinct from 
one another, snowflakes constitute high- 
ly symmetrical six-branched dendrites, plates, 
prisms, and needle structures. The establish- 
ment of snowflake crystals relies on the orderly 
buildup of water molecules from an oversatu- 
rated gas phase onto the crystal facets domi- 
nated by molecular interactions. The synthesis 
of artificial snowflakes has allowed a phenom- 
enological understanding of the crystallization 
process (1, 2). A snowflake’s sixfold symmetry 
stems from the hexagonal arrangement of the 
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ice lattice as well as ambient growth conditions, 
including temperature and humidity, which 
in turn determine the crystals’ morphologies 
(3). The morphological evolution of snowflake 
crystals is just one example of the highly com- 
plex crystal growth behaviors found in many 
natural and synthetic systems (4-7). 

Here, extending the analogy of ice snowflakes 
to metals, we show that metallic snowflakes 
can be grown in metallurgical processes using 
liquid metals. The features that distinguish a 
liquid metal solvent from its nonmetallic coun- 
terparts are intuitive, as it is a dense and strongly 
interacting liquid that can solvate metal atoms 
in their neutral state, thus avoiding any need 
for the reduction of a metal precursor. In prin- 
ciple, the relatively isotropic nature of bonding 
in metals would not suggest structural aniso- 
tropy induced by the liquid metal solvent. How- 
ever, in the case of a low-melting-temperature 
p-block post-transition metal such as gallium 
(Ga), the tendency for lower-density struc- 
tures to form in the solid state and the fact 
that covalent character coexists with metallic 
bonding make the assumption of isotropic in- 
teractions overly simplistic (8). Given the con- 
siderable evidence of local structuring within 
liquid Ga at low temperatures (9-11), we hy- 
pothesize that liquid Ga can preferentially 
direct the growth of solute metal structures 
through a preference for either symmetry or 
orientation during crystal growth after nucle- 
ation. Consequently, rich crystal formation dy- 


namics and diversified crystal morphologies 
are expected to emerge from liquid alloy sys- 
tems. However, extracting the synthesized 
crystals from the liquid metal solvent while 
preserving their microscopic fine features and 
revealing their exposed crystal facets poses a 
considerable challenge. 

The emergence of Ga and Ga-based liquid 
alloys offers viable possibilities for growing 
metallic crystals. Given that, at moderate tem- 
peratures, most metals in the periodic table of 
elements have a certain solubility in Ga and its 
alloys (72-14), a great variety of crystals can po- 
tentially be synthesized via the proposed route. 
This route can also be extended to some semi- 
metals, such as silicon, but not to most non- 
metals, which have limited solubility in liquid 
metals at low temperatures. Although this con- 
cept can lead to great opportunities, accessing 
the synthesized crystals grown inside the liquid 
metal solvent remains a hurdle. This challenge 
may be related to the affinity (wettability) be- 
tween the crystals and metallic solvents. Un- 
like conventional synthesis, Ga-based liquid 
metals are not volatile, and as such, their high 
surface tension needs to be broken to isolate 
the generated crystals from the solvent (15). 
Thus, conventional physical sieving and fil- 
tering that work for separating synthesized 
particles from aqueous and organic solvents 
cannot be applied to liquid metals. Given these 
challenges, the possibility of extracting sub- 
micrometer and nanodimensional crystals 
from inside a liquid metal reaction medium 
has been largely ignored. We hypothesize that 
one effective way to access such nanostruc- 
tures is to direct the generated crystals inside 
a liquid metal toward its surface and then 
reduce its surface tension by electrocapillary 
modulation (J6, 17). 

In this study, we explored the morphology 
and composition selection of the crystal for- 
mation process of metals dissolved in liquid Ga 
as a solvent. We demonstrated that the metal 
crystals can be effectively extracted from the 
liquid Ga solvent by combining electrocapil- 
lary modulation and vacuum filtration. We 
used the binary zinc-gallium (Zn-Ga) system 
as the main model that gives rise to various 
types of highly symmetrical Zn structures, 
resembling that of the six-branched snow- 
flake crystals. The influence of solute concen- 
tration, crystal growth time, temperature, and 
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Fig. 1. Schematic presentation of the crystal formation and extraction 
strategy. (A) Experimental procedures to grow metal crystals in a liquid metal 
solvent. Solute metal, solvent metal, and the parameters (including temperature 
T, concentration C, pressure P, and time t) that determine the crystal structure 
and size are indicated. (B) Separating crystals from the liquid metal using an 
electrical field together with vacuum filtration. (©) Examples of various types of 


pressure were investigated for Zn-Ga systems 
and related to snowflake-like crystal morphol- 
ogies. Subsequently, we tested other solute 
metals of tin (Sn), bismuth (Bi), silver (Ag), 
manganese (Mn), nickel (Ni), copper (Cu), 
platinum (Pt), the solute metal couple of Mn 
and aluminum (Al), and the solvent effect, to 
demonstrate the morphological and compo- 
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sitional diversity of crystals and showcase 
possible applications. 

We synthesized a large variety of metallic 
crystals featuring distinct crystalline shapes 
using Ga liquid metal as the reaction medium 
(Fig. 1). Solute metals were first dissolved in 
Ga at an elevated temperature (7};) (Fig. 1A 
and table S1). After homogeneous liquid alloys 


ge. 
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crystals featuring different morphologies and compositions. (D and E) Top-view 
snapshot structures of (D) the Zn(0001) facet and (E) the Zn(1010) facet 
interfaced with liquid Ga. Zn atoms are in dark gray, and the underlying Ga atoms 
are in green. (F) Radial distribution functional and angular distribution functional 
analyses for the Zn(O001) and Zn(1010) surface terminations when interfaced 
with liquid Ga. a.u., arbitrary units. 


were formed, we cooled them to room tem- 
perature (77) to allow the solute metals to 
become oversaturated and precipitate out of 
the liquid metal solvent in crystalline forms, 
while the Ga bulk remained in its liquid state 
owing to supercooling (18). We first attempted 
to filter the metallic colloidal solution through 
a porous nylon filter membrane (pore size: 
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20 um). However, the high surface tension 
of liquid Ga prevented the separation of the 
metallic crystal precipitates from the liquid 
metal solvent. To break the surface tension, 
we applied a potential of +5 V to the liquid 
metal in NaOH solution (1 mol liter™), while 
simultaneously applying vacuum-assisted fil- 
tration (Fig. 1B and fig. S1). Electrochemical 
actuation has been shown to enable the tuning 
of a liquid metal’s surface tension from its nat- 
ural value to near zero (19-21). After applying 
the voltage, the liquid metal flattened and 
expanded instantly because of the drastic de- 
crease in its surface tension (22). After reduc- 
tion of the surface tension, the liquid Ga was 
able to pass through the filter together with the 
NaOH solution. Our measurements indicated 
that nearly the entire volume of liquid Ga could 
pass through the porous membrane, leaving 
behind crystals topped with traces of Ga. After 
an additional washing step using NaOH to 
remove the remaining Ga in the filtered resi- 
dues, we obtained clean metallic crystals (Fig. 
1C). Depending on the choice of metallic solute, 
the intrinsic crystal lattice structure, and crys- 
tallization dynamics, we could produce intricate 
and distinct crystal morphologies by tuning the 
growth conditions. 

To demonstrate the distinguishing features 
of a strongly interacting liquid metal synthe- 
sis solvent and to test our hypothesis on the 
solvent’s role in directing anisotropic crystal 
growth, we performed ab initio molecular 
dynamics (AIMD) simulations to calculate 
the interfacial energies for the interaction 
between the low-index (0001) facet and (1010) 
facet (Bravais-Miller index) of hexagonal 
Zn crystals and a Ga liquid, Ga(1) (23). Given 
that a larger magnitude interfacial energy 
indicates a stronger interaction between 
the interfaces, the results demonstrate that 
the Zn(1010)/Ga(]) interface interacts more 
strongly (-8.290 eV nm ”) than the Zn(0001)/ 
Ga(1) interface (-6.381 eV nm”) (table $2). 
Because of the increased strength of inter- 
action between Zn(1010) and Ga(1) [i.e., the 
increased effect of the Ga() surface disorder 
on the Zn crystal structure], the structural 
ordering within the interfaced Zn(1010) slab is 
reduced compared with that of the Zn(0001) 
crystallographic orientation (Fig. 1, D and E, 
and fig. S2). By comparison, in the more 
weakly interacting Zn(0001)/Ga(1) interface, 
the Zn(0001) slab has a more clearly defined 
structure compared with the interfaced Zn 
(1010), as reflected by the more well-defined 
peaks (narrower peaks and/or lower minima 
between peaks) in both radial distribution 
functional analysis and angular distribution 
functional analysis (Fig. 1F). This result agrees 
nicely with the surface energies for Zn, which 
indicate that the Zn(0001) facet is more sta- 
ble than the Zn(1010) facet (24). Therefore, 
Zn(0001) faceting is expected to be favored 
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during the growth of Zn crystals in liquid Ga, 
as Zn(1010) is destabilized by its stronger 
and more structurally disruptive interfacial 
interaction. 

The preferred Zn(0001) faceting predicted 
by the AIMD simulations is validated by our 
experimental observations of Zn crystal growth 
in liquid Ga. When Zn is used as the solute, 
the crystals formed in the liquid Ga strongly 
resemble snowflake morphologies, particu- 
larly when considering the highly varied crystal 
types obtained from a single Zn,-Ga, system 
(@ wt % of Zn and y wt % of Ga). The Zn crys- 
tals extracted from the same Zn; Gago sample 
have different structures, as shown in the scan- 
ning electron microscopy (SEM) images in 
Fig. 2, A to C, and varying the growth condi- 
tions (time, temperature, and pressure) gives 
rise to substantial structural changes (Fig. 2, 
A to G). At 10%, the Zn content is above its 
solubility limit of 3.64 wt % in liquid Ga at 
room temperature. The observations suggest 
that increasing the growth duration, in gen- 
eral, leads to larger crystals (Fig. 2F). Despite 
their distinct shapes, all the crystals feature 
sixfold symmetry, which stems from the hex- 
agonal lattice structure of Zn, as shown in our 
x-ray diffractogram (Fig. 2H). In addition to 
the AIMD simulations, we validated the for- 
mation of hexagonal Zn crystals on the basis 
of the thermodynamic properties of the Zn-Ga 
system using FACTSage 8.0 (fig. S3). From the 
x-ray diffraction (XRD) and energy-dispersive 
spectroscopy (EDS) maps (Fig. 2, H and I), the 
extracted crystals are clean and consist mostly 
of Zn and negligible amounts of Ga. The pris- 
tine Zn crystals are naturally oxidized on the 
surface when exposed to ambient air after ex- 
traction (fig. S4). 

The influences of the control parameters 
and solute-solvent energetics and interactions 
on the obtained crystals can also be interpreted 
by the classical theory of crystal formation 
(25). Multiple parameters, including temper- 
ature, concentration, pressure, and growth 
orientation (facet), can affect the formed crys- 
tals substantially through different pathways 
(23). For the Zn-Ga system, hexagonal Zn nuclei 
form first during cooling. The Zn crystals we 
obtained are grown by the gradual attachment 
of Zn atoms from the oversaturated liquid 
metal solvent to different facets. As evidenced 
by our simulations, the Zn(0001) basal facet 
(Fig. 2H) with well-defined structure and order 
is preferentially preserved in interaction with 
liquid Ga during crystal growth, leading to the 
observation of platelike snowflake structures 
(Fig. 2, A to E) and the substantially higher 
(002) XRD peak in comparison to the (100) 
peak (Miller index) (Fig. 2H and fig. S5) (23). 
Apart from that, effects such as secondary nu- 
cleation, facet instability, and difference in the 
growth rate on different prism facets [induced 
by the changes in repeatable step sites for 


atom attachment and other growth conditions 
(26, 27)] further lead to more complex crystal 
structures, such as fractals and dendrimers, 
rather than simple hexagonal plates. In addi- 
tion, splitting of crystal tips (corners), branched 
growth, and jagged edges resembling the Koch 
curve (28) are frequently observed in these 
crystals. 

Among all the influential variables, increas- 
ing the temperature at which the alloy was 
prepared to 550°C (Fig. 2D) generated a broad 
range of Zn crystal sizes that are typically 
larger than that of the 350°C case (Fig. 2A). 
At this higher temperature, the occurrence of 
12-branched Zn crystals becomes frequent, 
coexisting with much smaller six-branched 
ones (Fig. 2D and fig. S6). As in the formation 
of 12-branched snow crystals (29), 12-branched 
Zn crystals are considered as the concurrent 
growth of two overlapping hexagonal crystal 
seeds. Therefore, the emergence of 12-branched 
Zn crystals at higher initial temperature means 
that, apart from growth rate, temperature 
also affects the growth regime of the crystals 
owing to changes in their thermodynamic 
growth path. Zn crystals grown at elevated 
pressure (5 bar) feature simple fractal shapes, 
which are also larger in size than those grown 
under ambient pressure (Fig. 2E), presum- 
ably as a result of facilitated nucleation due 
to a decrease in surface tension of liquid 
Ga (30). 

The influence of growth time and solute 
concentrations on the morphology and size 
of the Zn crystals is further investigated in 
detail, which allows us to construct a morphol- 
ogy diagram for the Zn-Ga system (Fig. 3A). 
We show examples of extracted crystals from 
different Zn concentrations for day 2 in Fig. 3B 
and other growth times in figs. S7 to S9 (see 
also table S3). We summarize the characteris- 
tic shapes and relative sizes of Zn crystals 
grown at day 1, day 2, and day 10 across the 
Zn concentration range from 5 to 20 wt % 
(350°C initial temperature and ambient pres- 
sure) in Fig. 3A. The transition in crystal mor- 
phologies is determined by the tendency of 
anisotropic growth (31), with the hexagonal 
plates and the dendrimers having the lowest 
and highest anisotropies, respectively. 

The morphology diagram reveals the sig- 
nature sixfold symmetry of all the Zn crystals, 
despite their highly intricate shapes, which is 
a clear indication of the hexagonal Zn lattice 
structures and the Zn(0001) faceting revealed by 
our AIMD simulations. The formation of den- 
drimers is more favorable than simple fractals 
and plates at low Zn concentrations (Zn;Gags 
and Zn, 9Gago, except for Zn;Gag; at day 10), 
whereas high Zn concentration (Zng Gago) 
predominantly leads to a combination of frac- 
tals and plates. In addition, we observed at high 
Zn concentration more fractals than hexagonal 
plates (fig. S9). We saw the growth of the Zn 
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Fig. 2. Characterizations of Zn crystals extracted from liquid ZnjgGago 
alloy. (A to C) SEM images of Zn crystals obtained after (A) 1 day, (B) 2 days, 
and (C) 10 days of growth time at 350°C initial temperature. (D) SEM image 
of Zn crystals after 1 day of growth at 550°C initial temperature. (E) SEM image 
of Zn crystals formed after 1 day at 5 bar. (F) Size distribution of the crystals 


crystals over time for the group with 10 wt % 
Zn. However, we did not find a size dependence 
on growth time of the crystals for the Zn;Gags 
and Zn Gago groups. Much larger dendrimers 
are obtained when the Zn concentration 
is further increased to 30 wt % after a day 
of growth time (fig. S10). The switching of 
growth regimes from dendrimers (Zn;Gags 
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and Zn, Gago) to fractal and/or hexagonal 
plates (Zng Gago) and then back to dendrimers 
(ZnNz9Ga7o) is reminiscent of the snow crystal 
morphology diagram (2). The observed trends 
suggest that metal crystal growth in a metallic 
liquid shares its highly dynamic nature with 
nonmetallic systems and is sensitive to the 
prevailing growth conditions. We used a ma- 
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35 40 45 50 55 60 65 70 75 
20 (degree) 


under different experimental conditions in (A) to (E). (G) The ratios of different 
crystal morphologies (dendrimers, fractals, and plates) in the extracted crystals 

of (A) to (E). (H) XRD patterns of the extracted crystals showing a single phase of 
Zn (Miller index). (I) Energy-dispersive x-ray spectroscopy elemental mapping 
showing the distribution of the Zn, Ga, and O. 


chine learning model to confirm the sensitiv- 
ity of the crystal structures (figs. S11 and S12). 
X-ray imaging (fig. S13) and SEM and EDS 
(fig. S14) were also performed to observe the 
crystal dimensions and morphologies at dif- 
ferent growth times. 

We further investigated the solvent effect by 
changing the liquid metal solvent from Ga to 
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Fig. 3. Morphology diagram of the Zn crystals. (A) A morphology diagram drawn on the basis of the types of Zn crystals as a function of crystal growth time 
and Zn concentration under the same temperature (350°C) and pressure (atmospheric pressure) conditions. (B) Examples of Zn crystals with different Zn 
compositions after 2 days of growth time. Scale bars: 10 um. See examples for different growth times in figs. S7 to S9. 


Ga-indium (In) alloys for Zn crystal growth, 
which was found to generate the same crystal 
type (hexagonal Zn) but strengthen the (001) 
faceting (figs. S15 and S16). This is attributed 
to the decreased activation energy of the solute 
metal during nucleation as well as the reduced 
concentration barrier during crystal growth 
as a result of adding In to the Ga solvent (23), 
thereby leading to larger crystals at higher In 
ratios (fig. S17). 

As a distinctive metallic solvent, liquid Ga 
dissolves most metals in the periodic table, 
albeit to different extents. Therefore, the 
method can be extended to many other solute 
metals for growing and extracting metallic 
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crystals. We demonstrated this with the binary 
systems Snz9Ga7o, BigGagg, AS2Gags, MnoGags, 
NipGagg, CugGagg, and PtzGags (Fig. 4, A to 
G) and with the ternary Al;Mn;Gago system 
(Fig. 4H). We chose the concentrations mainly 
according to their respective phase diagrams 
and solubility. 

We obtained crystals of different morphol- 
ogies and compositions depending on their 
intrinsic crystal structure and the influence of 
growth conditions on each individual system. 
The crystals grown in the binary systems can 
be classified into two categories. This first 
category includes Sn-Ga, Bi-Ga, and the Zn- 
Ga systems in which the solute metal phase 


completely separates from the liquid Ga phase 
during crystallization (no intermetallic phase), 
leading to single-metal crystals (Fig. 4, A and B). 
Predominantly nonfaceted dendritic growth 
of tetragonal Sn crystals reaching hundreds 
of micrometers in size is observed in the Sn- 
Ga system, whereas faceted growth of cubic 
Bi crystals emerges in the Bi-Ga system, gen- 
erating exotic hollow hopper crystals that reach 
several micrometers in size. The other cate- 
gory, including Ag-Ga, Mn-Ga, Ni-Ga, Cu-Ga, 
and Pt-Ga, is distinguished by the formation of 
intermetallic compound phases between the 
solute metals and the solvent Ga. Both hexago- 
nal Ag,Ga and tetragonal MnGa intermetallic 
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Sn-Ga 


x= Pt-Ga fp) Cu-Ga 7 Ni-Ga m Mn-Ga o Ag-Ga oO Bi-Ga 


Al-Mn-Ga 


crystals grow into high-aspect-ratio rod (or 
wire) structures with diameters that are typi- 
cally less than 10 um (Fig. 4, C and D), imply- 
ing highly faceted and anisotropic growth of 
the two. The Ni-Ga, Cu-Ga, and Pt-Ga sys- 
tems produce cubes of cubic NiGa, (Fig. 4E), 
plates of tetragonal CuGay, (Fig. 4F), and edge 
and corner truncated tetragonal plates of 
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Fig. 4. Crystals formed and extracted 
from metal Ga systems other than 
Zn-Ga. (A to G) SEM images, XRD patterns 
and size distributions of the extracted 
crystals from the binary systems (A) 
Sn-Ga, (B) Bi-Ga, (C) Ag-Ga, (D) Mn-Ga, 
(E) Ni-Ga, (F) Cu-Ga, and (G) Pt-Ga 
(image on the right side shows 

both platelike and rodlike structures). 

(H) SEM image (right), EDS element 
mapping showing the distributions of Al 
and Mn, XRD pattern, and size distribution 
from a ternary Al-Mn-Ga alloy system. The 
growth conditions of all crystals are set to 
1-day growth time, 350°C initial temper- 
ature, and atmospheric pressure. The 
oxide phases identified in the XRD of (A), 
(C), and (D) are formed after the crystals 
are extracted from liquid Ga and are not 
discussed. The size distributions are 
surveyed according to their (A) length of 
dendrites, [(B) and (G)] side length of 

the crystals, [(C) and (D)] thickness 

of the rod structures, (E) side length of 
the cubic structures, (F) diagonal length of 
the plates, and (H) thickness of the crystals. 
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cubic Ga,Pt crystals, together with larger rod- 
like and dendritic structures (Fig. 4G). Apart 
from the intermetallic phases, these three sys- 
tems also show evidence of single-metal crystals 
of the solute metals. The elemental distribu- 
tions of the metals in the crystals can be found 
in the EDS maps in fig. S18. The crystal phases 
we observed can be traced back to their phase 
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diagram at the relevant concentration and tem- 
perature ranges (fig. S19). 

Simultaneously introducing the two solute 
metals Al and Mn into liquid Ga leads to a 
ternary system. The crystal growth dynam- 
ics in a ternary system are expected to have 
greater complexity simply by virtue of the 
three-pair interactions between the metal 
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atoms. In comparison, a binary system has only 
one interacting metal atom pair. Surprisingly, 
as evidenced by the EDS elemental mapping 
and XRD data, the Al-Mn-Ga system produces 
a binary Al,;Mn, intermetallic phase that ex- 
clusively consists of the two solute metals (Fig. 
4H). Given that the MnGa intermetallic phase is 
found to form in the binary Mn-Ga system 
(Fig. 4D), further introduction of Al must alter 
the energetics of crystal formation in the system. 
The Al,,Mn, has relatively low Gibbs energy of 
formation (32), and consequently, the emergence 
of Al,;,;Mn, under the current conditions should 
become energetically favored in comparison to 
any other possible intermetallic phases (fig. S20). 
The possibility, demonstrated here, of growing 
compound crystals through our simple yet ef- 
ficient liquid metal solvent growth method 
greatly expands the accessible crystal library. 

We further applied AIMD simulations to the 
Ag-Ga system to validate the role of interfacial 
energy and structure in directing the growth 
direction of liquid metal-grown crystals. Where- 
as the AgGa crystals grown from the Ag-Ga 
system share the same hexagonal lattice struc- 
ture with the Zn crystals from the Zn-Ga system, 
Ag»Ga forms long hexagonal rodlike structures 
(Fig. 4C) that are starkly different from the 
platelike hexagonal Zn flakes (Figs. 2 and 3). 
Such a distinct structural development of crys- 
tals grown from the two systems is indeed 
reflected in our simulations. The AgyGa(0001)/ 
Ga(1) interfacial interaction is increased (i.e., 
stronger) compared with that of the AgoGa 
(1010) /Ga() interface (table S2). Correspond- 
ingly, the Ag,Ga(0001) termination, which is 
more strongly interacting with liquid Ga, is 
found to be less structured than the Ag,Ga 
(1010) surface termination (fig. S21). Using the 
same arguments as outlined for the Zn-Ga 
system, we determine that the reduced inter- 
action between the Ag,Ga(1010) termination 
and liquid Ga allows a greater degree of struc- 
tural integrity to be maintained in this surface 
termination than in the Ag,Ga(0001) termi- 
nation. There will be less disruption of the 
Ag Ga(1010) faceting by the disordered Ga(1), 
leading to the rod structures. The Sn-Ga sys- 
tem that showed dendritic (nonfaceted) Sn 
crystal growth in experiments was addition- 
ally investigated through AIMD simulations of 
interfacial energies. In this case, the Sn(200)/ 
Ga(1) interface was found to be structurally 
stable, whereas the Sn(211)/Ga(1) interface 
was not—in essence, with the strength of the 
interfacial interaction being sufficient to fully 
disorder the Sn in this orientation (table S2). 
This strong destabilization is consistent with 
the dendritic growth of the Sn crystals observed 
in experiment (Fig. 4A). The comparison of 
the AIMD interfacial energetics and the struc- 
tural characteristics of these selected systems 
explains the experimentally observed distinct 
crystal morphologies. 
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Here, we presented the synthesis of a variety 
of single metal and intermetallic crystals of 
different structures using liquid metals by 
careful selection of the solute and solvent 
metals, as well as by regulating the parameters 
that influence the growth dynamics. Particu- 
larly, it was possible to either adjust the growth 
dynamics without changing the crystal type 
(compare the Zn-Ga system in Figs. 2 and 3 
and Zn-In-Ga system in figs. S15 to S17) or alter 
the generated crystals by introducing a suit- 
able element (compare the Mn-Ga system 
in Fig. 4D and Mn-Al-Ga system in Fig. 4H). 
Finally, the obtained materials, with facet- 
dependent response to external fields or 
stimuli, enabled a range of applications for 
which proof-of-concept examples in gas sensing, 
piezoelectricity, and photocatalysis were dem- 
onstrated (figs. S22 to S24) (23). 

We demonstrated that a variety of single 
metallic or bimetallic crystals can be grown 
in a liquid metal solvent to successfully extract 
crystals with fine features from the solvent 
using a combination of electrocapillary mod- 
ulation and vacuum filtration sieving. The in- 
fluences of solute concentration, solvent type, 
growth time, temperature, pressure, and the 
lattice structure of the nuclei on crystal growth 
were investigated to understand the growth 
dynamics. The morphology and size of the 
isolated crystals were rationalized by consid- 
ering the unit crystal structure, effects of facet 
selecting and anisotropic growth, and growth 
rate. We presented the uncanny similarity of 
Zn metal flakes to ice snowflakes. In agree- 
ment with experimental observations, our 
simulations suggest that for a solid crystal of 
defined shape to grow within a liquid Ga sol- 
vent, at the nanoscale, it should be able to 
preserve its structure despite the presence of a 
strongly interacting liquid metal environment. 
The findings advance the understanding of 
crystal growth using liquid metals as the sol- 
vent and present a viable method for extract- 
ing these crystals. The use of liquid metals 
offers a previously unexplored pathway toward 
the growth of metallic nanostructures that 
does not rely on reactive soluble precursors, 
enabling unimpeded growth and leading to 
the formation of complex solidification pat- 
terns. By demonstrating both the crystal growth 
and extraction methods of metallic crystals, 
our work advances the field of functional 
nanoparticle synthesis while also offering 
opportunities in additive manufacturing 
and metallurgy for future manufacturing. 
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PIEZOELECTRICS 


Giant electric field-induced strain 


in lead-free piezoceramics 


Geng Huangfu'}, Kun Zeng?*+, Binquan Wang’, Jie Wang’, Zhengqian Fu2*, Fangfang Xu2°, 
Shujun Zhang**, Haosu Luo®, Dwight Viehland®, Yiping Guo’* 


Piezoelectric actuators are indispensable over a wide range of industries for their fast response and 
precise displacement. Most commercial piezoelectric actuators contain lead, posing environmental 
challenges. We show that a giant strain (1.05%) and a large-signal piezoelectric strain coefficient 
(2100 picometer/volt) are achieved in strontium (Sr)-doped (K,Na)NbO3 lead-free piezoceramics, 
being synthesized by the conventional solid-state reaction method without any post treatment. The 
underlying mechanism responsible for the ultrahigh electrostrain is the interaction between defect 
dipoles and domain switching. The fatigue resistance, thermal stability, and strain value (0.25%) at 
20 kilovolt/centimeter are comparable with or better than those of commercial Pb(Zr,Ti)O3-based 
ceramics, showing great potential for practical applications. This material may provide a lead-free 
alternative with a simple composition for piezoelectric actuators and a paradigm for the design of high- 


performance piezoelectrics. 


iezoelectric actuators can directly con- 

vert electrical signal into mechanical 

strain and have been widely applied in 

consumer electronic products, transpor- 

tation, precise optical instruments, and 
microelectromechanical systems and robotics, 
to name a few (J, 2). The world market for 
piezoelectric devices, in which the piezoelec- 
tric actuator segment holds the major share, 
is expected to grow up to US$35.4 billion by 
2026.(3). For lead-free devices especially, the 
compound annual growth rate during 2019 
to 2024 is expected to be as high as 20.8% (4). 
The highest electric field-induced strain among 
piezoelectric ceramics is 1.3%, reported in the 
lead-containing material (Bi,La)FeO3-PbTiOs, 
which is driven by a high electric field of 
80 kV/cm (5). Considering environmental reg- 
ulations against the use of lead, substantial 
efforts have been made to search for lead-free 
alternatives (6-8). Among the lead-free piezo- 
electric systems, (Nay2Biy/2)TiO; (NBT)-based 
ceramics possess the highest strain of ~0.7% at 
50 kV/cm, but this is accompanied by large 
hysteresis, which is attributed to the transition 
between the ergodic relaxor and ferroelectric 
phases, driven by the high electric field (9-72). 
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Thus, the normalized strain Siax/Emax (equiv- 
alent to the average large signal piezoelectric 
strain coefficient d3,) is often below 1000 pm/V, 
whereas the strain at low electric field is very 
small in NBT-based ceramics (9, 17). By con- 
trast, (K,Na)NbO3 (KNN)-based piezoelectric 
ceramics have a comparable small signal piezo- 
electric coefficient d33 to commercially used 
Pb(Zr,Ti)O3 (PZT) ceramics (6, 13). However, 
current KNN-based ceramics generally need 
complex stoichiometric control and have elec- 
trostrains only up to 0.3%, even when driven 
by electric fields of 40 to 60 kV/cm (22, 14, 15). 
Considering practical actuation applications, 
the common driving electric field is on the 
order of 20 kV/cm for commercially used PZT- 
based ceramics (/6). Lead-free materials have 
not exhibited electrostrain properties (includ- 
ing strain value, hysteresis, thermal stability, 
and fatigue resistance) at driving electric field 
of 20 kV/cm comparable with those of com- 
mercially available PZT materials. 

The introduction of defect dipoles, focus- 
ing on B — Vg (where B refers to the dopant- 
occupying B-sites of ABO; perovskite, such 
as Mn, Cu, or Fe), was reported to be an 
effective strategy to enhance electric field- 
induced strain (17-20). For example, the defect 
dipoles in aged BaTiO; single crystal, where 
Fe’, — V;, defect dipoles locally align along 
the direction of spontaneous polarization, can 
provide a restoring force to regulate revers- 
ible domain switching, resulting in pinned 
polarization-electric field (P-E) hysteresis loops 
and high strain with large hysteresis (/, 18). In 
Mn-, Cu-, or Fe-doped KNN ceramics, defect 
dipoles cause asymmetric strain-electric field 
(S-E) curves (17, 19). Although high bipolar 
strain of 0.4 to 0.5% (driven by 50 kV/cm) 
and unipolar strain of 0.18% (driven by 35 kV/ 
cm) have been achieved in Cu- or Fe-doped 
KNN ceramics, Ar atmosphere-assisted sinter- 


ing and poling-aging procedure are essential 
(19). Thus, lead-free ceramics with simple com- 
positions, low cost, facile processing, and high 
electrostrain performance are highly desired. 
We introduce Vz, /na ~ Vo defect dipoles to 
achieve excellent strain performance in the 
simple composition [Ko 5¢-2)Nao.5-2)Sl'z]NbO3 
(KNSN1002) by means of the solid-state reac- 
tion method without any special treatments 
such as poling-aging (27). Giant bipolar electro- 
strains of 1.05% at room-temperature and 
1.67% at 160°C driven by 50 kV/cm are ob- 
tained in KNSN3 (@ = 0.03) ceramics, which 
are very high values for lead-free ceramics and 
comparable to those of lead-free or lead-based 
single crystals (1, 22, 23). For practical appli- 
cation scenarios, KNSN3 ceramics also possess 
higher unipolar strain (0.25% @ 20 kV/cm) 
than any other lead-free ceramics or com- 
mercially available PZT-based ceramics. Of 
particular interest is that KNSN3 ceramics 
exhibit good fatigue resistance, low hysteresis, 
and thermal stability, showing great potential 
to replace lead-based piezoelectric ceramics 
for actuator applications. The microstructural 
and electrical characterizations show that their 
excellent strain performance originates from a 
one-way-trip alignment of Vz /Na ~ Vo defect 
dipoles driven by external electric field and 
the interaction between defect dipoles and 
domain switching. 

We used x-ray diffraction (XRD) patterns 
for KNSN ceramics to investigate the com- 
position dependence of the crystal structure 
(Fig. 1A), where the crystallographic indexing 
refers to the pseudocubic unit cell. Clearly, all 
KNSN ceramics have a pure perovskite struc- 
ture, implying that the Sr ions have diffused 
into the KNN lattice. The enlarged view of 
{200} Bragg peaks indicates that the Sr doping 
has a substantial effect on average symmetry. 
According to the peak profiles, KNSN2 mainly 
presents an orthorhombic phase, whereas 
KNSN6 mainly presents a cubic-like phase. 
The Rietveld refinement results (fig. Sl and 
table S1) indicate multiphase coexistence in 
KNSN ceramics. As Sr content increases, the 
fraction of orthorhombic phase decreases, 
whereas the cubic-like phase increases; addi- 
tionally, the tetragonal phase is found in KNSN3, 
KNSN4, and KNSN6 ceramics, indicating 
that Sr can enhance average lattice symmetry. 
The XRD results agree with the temperature- 
dependent dielectric measurements (Fig. 1B), 
in which the peaks of characteristic tempera- 
tures for the orthorhombic-etragonal transi- 
tion (To.r) and the tetragonal-cubic transition 
(Tc) are relatively sharp in KNSN2 ceramics, 
broaden substantially in KNSN3 and KNSN4, 
and nearly disappear in KNSN6 ceramics. 

We measured P-E hysteresis loops and cor- 
responding current-electric field U-E) curves 
for KNSN samples (@ = 0.02 to 0.06, where x 
represents Sr content) (Fig. 1C and fig. S2A). 
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Higher Sr content (@ = 0.04, 0.06) leads to high 
lattice symmetry and very weak ferroelec- 
tricity, whereas the KNSN2 and KNSN3 can 
achieve robust ferroelectricity. Adding Sr is 
beneficial to obtaining high-density KNSN 
ceramics (figs. S2, B and D) and is conductive 
to an enhanced breakdown strength. KNSN3 
samples have the highest small-signal piezo- 
electric charge coefficient d33 (fig. S2B) be- 
cause of strong ferroelectricity with multiphase 
coexistence. KNSN samples (@ = 0.02, 0.03, 
0.04) exhibit asymmetric bipolar S-E curves 
that are similar to the B — V¢ defect dipoles- 
modified KNN ceramics (19), whereas the 
strain of KNSN6 samples show a conventional 
symmetric S-E curve (Fig. 1D). Additionally, 
KNSNB ceramics exhibit a giant bipolar strain 
of 1.05% at 50 kV/cm (fig. S3), correspond- 
ing to a piezoelectric strain coefficient of 
2100 pm/V, which is a high value for a lead- 
free piezoceramic. The detailed electrostrain 
performances (including test conditions, strain 
values, and corresponding piezoelectric strain 
coefficients Sinax/Emax) are Summarized in table 
S3. Next, we will demonstrate that the giant 
asymmetric strain observed in KNSN3 is caused 
by the synergistic effect of the aligned stable 
Ve ina ~ Yo defect dipoles, the multiphase co- 
existence and strong ferroelectricity. 

Point defects of K*/Na* vacancies (V;, /na) 
and oxygen vacancies (V;) are generally be- 
lieved to be unavoidable in KNN-based cera- 
mics. We used electron paramagnetic resonance 
spectroscopy (fig. S2C) to confirm the exis- 
tence of Vg in KNSN ceramics and we found 
that Sr can reduce the concentration of Vg as 
a donor dopant (Sr, ,.,,). Chemical compo- 
sition test results (table S2) also imply that 
the concentration of Vg decreases with in- 
creasing Sr content (@ = 0.01 to 0.06; the 
content of V; in pure KNN is slightly lower 
than that of KNSN1 because of its lower sin- 
tering temperature). We found that in KNSN 
ceramics with low Sr content (wv = 0, 0.01), the 
abundant V, and low bulk density (table S2 
and fig. S2B) cause substantial leakage current 
and low breakdown strength, making prop- 
erty measurement impossible. 

As there are no By, defects introduced in 
KNSN, the asymmetric S-E curves are consi- 
dered to be induced by Vi-iyq — Vo OF Vie iwa— 
Stix xq defect dipoles, in which K*/Na* vacan- 
cies Vz Na) €xist because of the volatilization 
of alkali metal oxides during high- temperature 
sintering (24). Analogous to By, — VG defect 
dipoles (17, 19, 20), Viewwa — V6 or Veeiwa 
Sry /Na is also expected to affect the strain 
behavior. The energy assumption of Vic /Na~ 
STx vq Yeorientation (2.34 eV) is much higher 
than that of V;. [Na V5 (0.90 eV) (fig. S4). 
Meanwhile, the V, /Na ~ Sry /Na defect di- 
poles are along <001> directions, deviating 
from the easy polarization rotation path in 
the orthorhombic phase, which possesses 
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<110> spontaneous polarization directions 
(18). Therefore, we assume that Vic /Na ~ Vo 
rather than Vz /Na ~ Sry /Na defect dipoles 
play a major role in KNSN ceramics. 

The differences in S-E behavior between 
KNSNB and PZT are clear when plotted (fig. 
S5). The electric field-induced strain in ferro- 
electric materials is closely related to polariza- 
tion with quadratic dependence (72), and the 
non-180° domain switching causes typical 
butterfly-shaped S-E curves (fig. $5, A and D). 
However, KNSN3 samples show an S-E curve 
that closely resembles their P-E hysteresis loop 
under 20 kV/cm (fig. $5, E and F). We found 
three distinctive features compared with those 
of PZT ceramics: first, KNSN3 samples have 
two strain values at zero field that correspond 
to two different strain states (a and d) (fig. 
S5F). Second, the two states can be switched 
by external electric field when synchronized 
with the polarization. As we observed (fig. S5, 
E to G), the strain state switches between a 
and d states if polarization switch occurs, and 
in this case, the samples exhibit higher strain 
with larger hysteresis. Third, the strain re- 
sponse of KNSN3 sample to unipolar electric 
fields with opposite directions are also contrary, 
in which negative strain is achieved under posi- 
tive applied electric field, and vice versa. 

To understand the defect dipole contribu- 
tion to the giant electrostrain, we investigated 
the polarization and strain behaviors of KNSN3 
samples in detail, with the first measurements 
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presented all from fresh samples (Fig. 2). After 
the first application of unipolar electric field, 
the KNSN3 samples show a remnant polariza- 
tion (P,) of 25 wC/cem? (Fig. 2A). However, the 
P, ends at -3 uC/cm? instead of at -25 wC/em? 
under the reversed electric field (Fig. 2A, sec- 
ond run). In the third run, the P, returns to 
25 uC/cm? under positive electric field. The 
three test results indicate that the overall P, 
during the first run contains a fixed part of 
~11 uC/cm?, which is comparable with the 
reversible part of ~14 uC/cm”, the latter orig- 
inating from the ferroelectric domain switch- 
ing. This phenomenon is not observed in 
normal ferroelectric materials (take PZT-5H as 
an example) (fig. S6). The piezoresponse force 
microscopy (PFM) characterization further de- 
monstrates that the polarization can indeed 
be nearly reversed by the electric field (fig. $7). 
Thus, the fixed part should result from the 
aligned Vie /Na ~ Vo defect dipoles along the 
external field during the first run, which is 
subsequently stabilized. In the ferroelectric 
lattice, defect dipoles exhibit high polarizabil- 
ity, several orders of magnitude higher than 
ferroelectric polarization (19), so they can po- 
tentially produce such a high amount of fixed 
polarization when they are aligned. According 
to the direction of the electric field, the do- 
main switching occurred only in the first, 
third, and fifth runs, whereas there was almost 
no domain switching involved in the second 
and fourth runs (Fig. 2B). In the first and 
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Fig. 1. Composition-dependent crystal structure and electric properties of KNSN ceramics. (A) X-ray 
diffraction (XRD) patterns. (B) Temperature dependence of the relative permittivity over temperature range of 
25° to 400°C measured at 10 kHz. (€) Polarization—electric field (P-E) hysteresis loops. (D) Strain-electric field 
(S-E) curves of KNSN ceramics at 40 kV/cm. The inset is the enlarged parts of KNSN4 and KNSN6. 
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second runs, the strain behavior was con- 
sistent with that of a normal ferroelectric, and 
possesses the following two features: the strain 
starts to increase only after the electric field 
exceeds coercive field (about -9.9 kV/cm). The 
first S-E curve has large hysteresis, whereas the 
subsequent second one shows much smaller 
hysteresis. However, typical KNN-based ce- 
ramics with domain switching alone cannot 
contribute to such a giant strain value of 0.58% 
at -20 kV/cm. Although the domain switching 
indeed occurs, the strain does not present the 
well-known butterfly shape. In the third run, 
the strain should increase for domain switch- 
ing alone but continues to decrease after coer- 
cive field, and in the fifth run, the strain should 
decrease for domain switching alone but con- 
tinues to increase before coercive field. 

We conclude that the interaction between 
aligned defect dipoles (Pg) and domain switch- 
ing should be responsible for the strain be- 
havior and giant electrostrain we observed 
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(Fig. 2, A and B). We propose an explanation 
that includes simply superimposing the lattice 
stretching or contraction from Pg with the 
butterfly curve from domain switching (fig. 
88). At the @ point, both defect dipoles and 
polarization are aligned after the initial elec- 
tric field stimulus. By applying positive electric 
field in an a-b-c route, the Pg causes continu- 
ous lattice contraction in both a-b and b-c 
routes, whereas the domain switching leads 
to strain decrease in the a-b route but strain 
increase in the b-c route. Thus, the overall 
strain exhibits an inflection point (point b) 
at the coercive field of the a-b-c route—a 
rapidly descending a-b segment and a slowly 
descending b-c segment. With the removal of 
the external field in the c-d route, the defect 
dipole-induced lattice contraction gradually 
weakens and leads to an increase in strain, 
whereas the aligned domains partially recover 
and lead to a decrease in strain. Thus, the 
overall strain stays nearly unchanged over 
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Fig. 2. The effects of defect dipoles on polarization and strain behaviors in KNSN3. (A and B) P-E behavior 
and S-E behavior of the initial few unipolar electric field applications on a fresh sample with corresponding 
wave profiles, respectively. The unipolar measurements were carried out in 2 seconds to clearly show the P-E 
behavior, so the waveform can be regarded as part of an alternating triangle wave with a frequency of 0.25 Hz. 
(C and D) P-E hysteresis loops and S-E curves during a 3-month aging process, respectively. 
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the c-d route. When the strain curve goes from 
point a to d, the 180° polarization switch oc- 
curs while the Pg remains fixed. In this case, 
with applying the negative electric field, the 
d-e segment shows a slow increase in strain 
because the Pg leads to lattice stretch but 
domain switching leads to strain decrease, 
while the e-f segment shows a rapid increase 
in strain because the P, continuously gives rise 
to lattice stretch and domain switching also 
leads to strain increase. When the electric field 
is removed over the f-a route, the lattice 
stretching by Pa gradually weakens and leads 
to a strain decrease, whereas the aligned do- 
mains partially recover and cause strain de- 
crease too. Thus, the overall strain shows an 
obvious decrease during the #-@ route, which 
is different from that of the nearly constant 
c-d route. The difference between f-a and c-d 
segments reveals that the P, is fixed; other- 
wise, the strain curve slopes of the segments 
should be similar. The substantial strain dif- 
ference between states a and d (the two zero- 
field states in S-E curve) originates from the 
interaction between ferroelectric domains and 
P,. That is, defect dipoles generate additional 
lattice stretching or contraction under exter- 
nal field beyond ferroelectricity. 

To further examine the interaction model 
between Pg and domain switching, we de- 
signed a poling-aging experiment (fig. S9). 
The fresh samples were first poled to align 
the defect dipoles, then poled by the rever- 
sal electric field at 120°C, followed by 14 days 
of aging. If the time of reversal poling is short 
(fig. S9A), or if the poling field is not high 
enough (fig. S9B), the domains reverse, and 
the defect dipoles do not. Thus, after aging, 
the strain behavior is consistent with the fifth 
run in Fig. 2B (domain switching and lattice 
stretching induced by defect dipoles). The 
defect dipoles can only be forced to reverse 
by prolonged high reversal fields poling, and 
aging (fig. S9C), in which the strain behavior 
is consistent with third run in Fig. 2B after 
these harsh treatments. The results shown in 
fig. S9 further verify the interaction between 
domains and defect dipoles. 

After the Pq is formed by external electric 
field, its high stability is evidenced by the 
poling-aging results and the direct-current 
strain behavior. After 3 months of aging, the 
P-E and S-E loops have minimal change (Fig. 
2, C and D). The Pg are hard to reorient (fig. 
S9C), thus verifying the long-term stability of 
the Py. Additionally, the aging process will 
not enhance the electrostrain value, indicat- 
ing that all the defect-dipole alignments are 
achieved in the first application of electric 
field (Fig. 2D). Furthermore, the Pg cannot be 
reoriented by low-frequency stimuli even in 
the direct current condition (fig. S5G). 

To understand the effect of V;. ina ~ Vo de- 
fect dipoles on strain behavior, we further 
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Fig. 3. Microstructure characterizations of KNSN3. (A to C) The TEM dark- 
field images for stripe, nanosized, and mixed domains, respectively. (D and 

E) Atomic-scale HAADF images of a stripe domain region and a nanosized 
domain region, respectively, where horizontal and vertical directions correspond 
to two perpendicular <100> crystallographic directions. The mixing of various 
atomic displacement states suggests multiphase coexistence in nanosized 


investigated the microstructure of KNSN3 
ceramics. Representative domain structures 
for KNSN3 ceramics, which exhibit three dif- 
ferent morphological features, are shown in 
Fig. 3, A, B, and C. The first feature is stripe 
domains (Fig. 3A), of which the domain walls 
mainly lie in {110} or {001} planes. This crys- 
tallographic characteristic indicates that the 
areas with stripe domains are consistent with 
orthorhombic symmetry. The second feature 
is nanosized domains (Fig. 3B), of which the 
domain walls do not lie in specific crystal 
planes, implying a corresponding complex 
phase structure. The third feature is mixed 
domains, of which the nanosized domains are 
usually surrounded by stripe domains (Fig. 
3C) which dominate the domain morpholog- 
ical feature in most of the grains. The coexist- 
ence of the stripe and nanosized domains is 
also found in KNSN2 (fig. S10), which has 
asymmetric S-E curve (Fig. 1D). The energy 
dispersive spectrum (EDS) mappings of areas 
of KNSN3 ceramics containing dozens or sin- 
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gular grains demonstrate an inhomogeneous 
distribution of K and Na elements (fig. S11), 
which is a common phenomenon in KNN-based 
ceramics. The Na-rich regions have additional 
Y4(00e) superlattice reflections (where o and e 
denote odd and even Miller indices, respec- 
tively) in selected-area electron diffraction 
(SAED), whereas the K-rich regions only ex- 
hibit the main reflections of perovskite struc- 
ture (fig. S12). According to the previous 
structural studies on KNN ceramics, the 14(00e) 
superlattice reflections are most likely the 
result of the in-phase tilting of oxygen octahe- 
dra (25). 

With the help of spherical aberration-corrected 
scanning transmission electron microscopy 
(C,-STEM), we revealed the structures on the 
atomic scale (Fig. 3, D and E). Analysis of the 
local atomic displacement in the high-angle 
annular dark field (HAADF) image shows that 
Nb cations are homogenously displaced along 
the <110> direction in the stripe domain re- 
gion, which is consistent with the polarization 
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domain region. (F) The original atomic-scale HAADF image and the image 
marked with normalized A-site intensity by color of the same area. (G and 

H) Evolution of interplanar spacing of {100} planes during in situ electric stimuli 
of a stripe domain region and a nanosized domain region, respectively. The 
error bars represent the standard deviations of five times the measurements 
from the experimental images. 


direction of the orthorhombic phase (Fig. 3D). 
By contrast, both the direction and magnitude 
of Nb cation displacement vary substantially 
in nanosized domain regions. According to the 
projection of the pseudocubic unit cell, the 
nanosized domain region suggests a multi- 
phase state involving orthorhombic, tetrag- 
onal, and cubic-like phases (Fig. 3E), which 
is consistent with the average XRD results 
(table S1). We observed a high concentration 
of Vie; Which was evidenced by the drastic- 
ally variable intensity of A-site atomic columns 
(Fig. 3F), providing a basis for the formation of 
abundant defect dipoles. 

In situ-biased TEM results on nanosized 
domain regions and stripe domain regions in- 
dicate the microscopic evolution of their lat- 
tices under electric field. Figure 3G shows the 
evolution of measured lattice parameters along 
two orthogonal <100> directions in the pure 
stripe domain grain, where the lattice exhibits 
symmetric change under the electric fields 
along both directions, which is consistent with 
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Fig. 4. Excellent electrostrain properties of KNSN3. (A) Unipolar strains of KNSN3 at electric field 

from 20 kV/cm to 50 kV/cm. (B) Comparison of unipolar strain at 20 kV/cm among piezoceramics 

(8, 10, 30, 31). (C) Unipolar 20 kV/cm fatigue at room temperature. See table S4 for details of fatigue 

and test conditions. (D) Thermal stability comparison between KNSN3 and commercial lead-based ceramics 
(8, 32). (E) Unipolar Strain performance comparison among representative piezoelectrics with giant 
electric-field-induced strains (1, 2, 7, 9-11, 19, 23, 33-38). 


the strain behavior observed in ferroelectric 
domains. By contrast, the changes in local 
lattice parameters of the pure nanosized do- 
main grain (Fig. 3H) reveal that the lattice 
parameters decrease under positive electric 
stimuli, while they clearly increase under neg- 
ative electric stimuli. The stretching and con- 
traction of lattices in nanosized domain regions 
are in good agreement with the shape of S-E 
curves (Fig. 1C). 

According to the Landau-Ginzburg-Devonshire 
phenomenological theory, the free energy pro- 
file becomes flat when different ferroelectric 
phases coexist, representing a low barrier for 
polarization rotation among different states 
(26). Because of the multiphase coexistence 
in nanosized domain regions, rotation and 
alignment of the defect dipoles along the 
electric field during the first application of 
the external field is energetically favorable. 
According to the theory proposed by Arlt and 
Neumann (27), P, will introduce an extra elec- 
tric field on the free energy profile (fig. S13), 
causing the system to descend into a potential 
well in which the ferroelectric polarization of 
the nanosized domain regions align along the 
same direction as Py. This observation is also 
supported by the “collinear” polar nanoregions 
where the electrostatic, gradient, and elastic 
energies keep them stable after reorientating 
along each other (28). Therefore, the defect 
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dipoles in nanosized domain regions will re- 
main fixed after the alignment, which results 
in the fixed part of the observed polarization 
(Fig. 2A). In addition, compared with Cu- or 
Fe-doped KNN ceramics (19), the V;, /Na ~ V5 
defect dipoles have the same direction with 
the spontaneous polarization in the orthorhom- 
bic phase of KNSN ceramics. These character- 
istics allow the Vz ina ~ Vo defect dipoles to 
require no aging for alignment in multiphase 
coexistence with nanosized domains. When 
an external electric field is applied, the defect 
dipoles drive stretching or contraction of the 
surrounding lattice and interact with the do- 
main switching, leading to the asymmetric S-E 
curve, even under high electric field conditions. 
More recently, similar strain behavior has also 
been reported in NBT ceramics (29), in which 
ultrahigh strain is associated with switching 
between disordered and ordered polarization 
states, complex structure with multiphase co- 
existence, non-180° switching, and domain 
stabilization. The polarization in KNSN3 sam- 
ples, however, can be almost reversed by the 
electric field (fig. S7). Thus, we believe that 
the coupling of stabilized Vz, /Na ~ Vo defect 
dipoles aligning along <110> direction and 
ferroelectric domain switching is responsible 
for the observed high asymmetric strain level. 
For actuator applications, electrostrain is 
usually evaluated from the unipolar strain 


value. We achieved a unipolar strain as high 
as 0.7% at 50 kV/cm (Fig. 4A) (detailed data 
are summarized in table S3). Considering the 
common driving electric field of 20 kV/cm, 
the unipolar S-E curves and strain values for 
typical lead-based or lead-free piezoceramics 
are compared in Fig. 4B. KNSN3 ceramics 
clearly have the highest strain value (0.25%). 
Unlike the large-strain hysteresis observed in 
aged BaTiOs single crystal (reversible domain 
switching) and NBT-based ceramics (field- 
induced phase transition) (0, 18), low hys- 
teresis is observed when the external field, 
polarization, and defect dipoles are all in the 
same direction (Fig. 2B, second run), with the 
domain switching nearly not involved. There- 
fore, such a unipolar strain with low hysteresis 
(31% at 20 kV, comparable with commercially 
used PZT ceramics) is expected to benefit pi- 
ezoelectric actuation. In addition, the high sta- 
bility of the Pq will benefit fatigue resisance. 
We characterized the cycling reliability of 
strain under 20 kV/cm unipolar electric field, 
at which the unipolar strain only degrades 9% 
after 10” cycles (Fig. 4C) (the detailed data of 
fatigue tests are summarized in table S4). 
KNSN3 ceramics have excellent thermal sta- 
bility (Fig. 4D), with unipolar strain variation 
of less than 5% over a temperature range of 
25°C to 100°C, comparable with PZT-4 and 
better than PZT-5H ceramics. Therefore, KNSN3 
ceramics have immense potential to replace 
Pb-based piezoelectric ceramics, considering 
their capability for giant strain, low driving 
electric field, good fatigue resistance, and ther- 
mal stability. 

When the temperature exceeds 100°C, the 
electric field-induced strain of KNSN3 cera- 
mics further increases with the increase in tem- 
perature (fig. S14). The highest electrostrains— 
1.67% for bipolar and 1.2% for unipolar, both 
very high values for piezoceramics (table S3)— 
were achieved at 160°C. We also observed the 
good fatigue resistance of KNSN3 at 160°C 
(fig. S14C and table S4). These performances 
prove the great potential of KNSN3 ceramics 
for high-temperature actuator applications, 
such as piezoelectric injectors in diesel engines 
(2. We also considered whether polymorphic 
phase transition is the reason for the observed 
enhanced strain (fig. S14, D and E). The dif- 
fused dielectric anomaly that occurs around 
160°C suggests the orthorhombic-tetragonal 
phase transition, whereas the disappearance 
of the Raman mode at 200 cm™ also confirms 
the change in crystal symmetry (2). We expect 
even higher strain at room temperature may 
be obtained by decreasing T>:;—for example, 
by means of Li doping (/4). 

We provide a comparison of unipolar strain 
performance of the studied KNSN3 ceramics 
with representative piezoelectrics (Fig. 4E), 
in which KNSN3B is superior to the lead-free 
ceramics and even comparable with those of 
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both lead-free and lead-based single crystals. 
The underlying mechanism responsible for 
the giant electrostrain is the coupling of the 
defect dipoles with ferroelectric domains, which 


is done by tailoring the Vie Na ~ Vo defect 


dipoles and microstructure, thus providing 
a paradigm for the design of giant-strain 
piezoelectric materials. Considering the high 
strain under 20 kV/cm, good fatigue resistance, 
and thermal stability, KNSN is expected to be a 
great potential lead-free alternative for broad- 
temperature range and high-displacement 
piezoelectric actuator applications. 
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THE ONSAGER FELLOWSHIPS 


16 tenure-track positions available at NTNU 


The Norwegian University of Science and Technology 
(NTNU) is Norway's primary institution for educating 
the future’s engineers and scientists. The university 
also has strong programmes in the social sciences, 
teacher education, the arts and humanitites, medicine, 
architecture and fine art. NTNU's cross-disciplinary 
research delivers creative innovations that have far- 
reaching social and economic impact and that help 
contribute to a better world. 


The Onsager Fellowship programme at NTNU is designed 
to attract the most talented scholars with an established 
reputation for high quality research and a commitment to 
learning and teaching at the university level. 


BO NTNU | seenceenetectnotogy 


APPLY FOR A TENURE-TRACK POSITION AS 
AN ASSOCIATE PROFESSOR IN: 


* Quantum chemistry 


* Computational 
Porous Media Physics * Post-quantum 

* Socially-innovative Cryptography 
cities + Experimental 

Quantum 

Communication 


* Artificial Intelligence 
(Al) and Health 


* Brain MRS 


* Public and Political 
Economics 


* Biosystematics 


* STS and sustainability 
transitions 


* Circular Building 
Design 

+ Experimental 
Environmental 
Psychology 


* Political Behaviour 
* Solid Mechanics 

* Marine structures 

+ European Studies 


More info at: www.ntnu.edu/onsager-fellowship/ 
Closing date: 1 February 2023 


TOP RESEARCH DEMANDS BRILLIANT MINDS 
— WE'RE ALWAYS LOOKING FOR THE BEST 


GLENN FOUNDATION 


TOR MEDICAL RESEARCH 


Bild 
fond 


arrerican federation 
tor aging research 


2023 FUNDING OPPORTUNITIES 


RESEARCH GRANTS FOR JUNIOR FACULTY 
$150K, 1-2 years / LOI deadline:12.15.22 


POSTDOCTORAL FELLOWSHIPS IN 
AGING RESEARCH 
$75k, 1 year / LOI deadline: 01.25.23 


GLENN FOUNDATION DISCOVERY AWARDS 
$525K, 3 years / LOI deadline: 02.15.23 


For Research Focus, Eligibility Requirements, and More, 


visit www.afar.org/funding-opportunities 
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TENURED/TENURE-TRACK ASSOCIATE AND 
ASSISTANT PROFESSOR POSITIONS 


The Center for Drug Discovery (CDD) at Northeastern University (NU) invites 
applications for two junior (pre-tenure Assistant Professor) and two senior 
(Tenure-on-Entry Associate or Full Professor) positions with projected 
start dates in 2023. The mission of the CDD is to engage in drug discovery 
and development that will lead to new medications, particularly involving brain 
disorders, infectious diseases, and cardiovascular disorders. To accomplish 
this, the CDD is directed to develop novel methodologies to improve drug 
discovery. Junior scientists using breakthrough technologies are welcome to 
apply as are senior faculty with internationally recognized programs and strong 
funding, whose expertise will complement activities of faculty affiliated with 
the CDD toward drug discovery. 


A goal of this recruitment effort is to establish CDD as an NIH-funded Center 
of Excellence relevant to the mission of institutes that represent the Center’s 
membership. The CDD currently holds a NIDA T32 training grant for graduate 
students and postdoctoral fellows available to trainees of CDD faculty. The 
CDD also holds a NIDA R13 grant that sponsors the annual Chemistry and 
Pharmacology of Drug Abuse Conference at NU. The administrative team of the 
CDD provides pre- and post-award support to its members. The impact of the 
CDD is dependent on the outstanding quality of its membership and the strong 
interactions they develop with each other. Each senior CDD member is a founder 
of an active start-up company. 


Interested candidates should apply with a cover letter, curriculum vita that 
includes a list of publications, statements addressing Research, Teaching 
and Equity, and names & contact information for at least three professional 
references. For more detailed information on these positions and how to apply 
please consult the University posting: 


https://northeastern.wd1.myworkdayjobs.com/careers/job/Boston-MA- 


Main-Campus/Assistant-Professor--Associate-Professor--Full-Professor_ 
RIW62T-  ———— 


Successful candidates will have the opportunity to select among four 
participating Colleges and multiple departments for their academic home. Joint 
appointments in departments across different Colleges are highly encouraged 
at NU. NU is an equal opportunity employer, seeking to recruit and support a 
broadly diverse community of faculty and staff. 
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What's Your Next Career Move? 


From networking to mentoring to evaluating 
your Skills, find answers to your career questions 
on Science Careers 


To view the complete collection, visit 
ScienceCareers.org/booklets 
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TULAN NIVERSITY 
SCHOOL of SCIENCE 
& ENGINEERING 


The Department of Ecology & Evolutionary 
Biology (EEB) at Tulane University invites 
applications for a faculty position in 
Ichthyology (associate or full professor 
rank) who will also serve as Director of the 
Tulane University Biodiversity Research 
Institute (TUBRI) and Curator of the Royal 
D. Suttkus Fish Collection. 


See https://sse.tulane.edu/eebio/about/ 
positions/Professor-of-Ichthyology for 
additional details about the position, the 
department and TUBRI. 


To apply please visit https://apply.inter 
folio.com/117771. Review of applications 


will begin January 2, 2023. The search will 
remain open until the position is filled. 


Tulane University is an Affirmative Action/ 
Equal Employment Opportunity/ADA 
Employer. Women and minorities are 

encouraged to apply. 
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myIDP: A career plan customized 
for you, by you. 


Features in myIDP include: 


= Exercises to help you examine your skills, interests, and values. 


= Alist of 20 scientific career paths with a prediction of which ones best ‘ DF 


fit your skills and interests. 


Visit the website and start planning today! 
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WORKING LIFE 


By Madeline Schultz 


1134 


Working through grief 


s I stepped out of the chemistry building, a chirp from my phone alerted me to a text message 

from my mom. “Are you still at work? Call me when you get home.” My stomach sank as I saw 

three missed calls: two from my mom and one from my best friend Gabs. After a half-dozen 

rings, I got my mom’s answering machine. The pit in my stomach deepened when I called Gabs, 

who choked out, “Mads, you have to call your momma.” My hands were sweating as I gripped 

the steering wheel on my short drive home. As soon as I stepped out of my car, my mom called 
me back. “Daddy had a heart attack while he was surfing today, he’s dead, I’m so sorry.” I sunk to my 
knees in the grass and vomited repeatedly, shaking with sobs. 


That evening, I booked a one-way 
ticket to my mom. I emailed my ad- 
viser on the way to the airport. His 
response brought a wave of relief: “I 
am so sorry! Please take whatever 
time you need. I hope you are able 
to get home quickly” The next few 
weeks were a blur of celebrations of 
life, lawyers, and meals I could hardly 
stomach. My research barely crossed 
my mind, and my adviser reminded 
me with unconditional kindness, 
“Please don’t feel any pressure to rush 
back to lab.” 

But after a month away, I felt the 
best thing for me would be to try to 
force a return to normalcy—which 
meant returning to school. So, I went 
back to lab, hoping I could put my 
grief aside for at least a few hours 
while I focused on the research I had 
previously been so enthusiastic about. 

But as soon as I returned, I was reminded that my life was 
still far from normal. As graduate students and professors of- 
fered well-meaning sympathy, I felt embarrassed and isolated 
by my sadness. Seeing the instrument I had been constructing 
for the past year, I remembered all the times I had FaceTimed 
my dad, a master craftsman, asking for advice on which tools 
to use and how to remove a stripped screw without damaging 
fragile electronics nearby. Instead of getting back to work, I 
hid in the doorway between two labs to cry. 

I felt pathetic: After taking an entire month off, I had ex- 
pected to tackle my ever-growing to-do list. As the first grad- 
uate student in a new group, I felt entirely responsible for 
keeping our research going. If I didn’t step up, I worried our 
progress would stall. 

But despite this self-imposed pressure, about an hour after 
lI arrived in lab that first day I had reached my limit. Sobbing 
and trembling, I went to the yoga studio. In the heat of the 
studio and the depth of the stretch, I slowly began to calm. 


“Not being able to tell my dad 
dulled the accomplishment, 
but | know he would be proud.” 


For the next few weeks, I dragged 
myself to lab every day, but I was un- 
productive and uninterested in my 
research. I began to doubt whether 
my fascination for science would ever 
return. Only a year into my Ph.D., I 
didn’t see how I could possibly make 
it through the years to come. My de- 
spair deepened. 

Then, a few weeks after return- 
ing, a senior graduate student in an- 
other lab who was training me on a 
new instrument asked me how the 
start of my second year was going. I 
didn’t know her well, but something 
about her made me feel comfortable 
opening up. I broke into tears, and 
she shared with me her experience 
of losing a loved one during graduate 
school. It was the first time since I re- 
turned that I felt understood. 

From then on, I slowly began to feel 
a bit more normal each day. As my all-consuming grief gradu- 
ally dulled to a steady hum, my passion for my research re- 
turned. While analyzing a new sample or constructing a new 
piece of my instrument, I began to feel my old excitement 
again. Three months after returning, I realized I had gone a 
full week without crying in the space between labs. By piecing 
together these fleeting moments of productivity, I was able 
to collect enough data to submit an abstract, which turned 
into a conference presentation. Not being able to tell my dad 
dulled the accomplishment, but I know he would be proud. 

One year and a few months after his death, I am still learn- 
ing how to handle my grief, a little more each day. When I 
look at my instrument now, I feel a deep sadness that I can’t 
share it with my dad. That sadness will never go away. But I 
also feel proud that I carried on. 


Madeline Schultz is a Ph.D. student at the University of Rhode Island, 
Kingston. Send your career story to SciCareerEditor@aaas.org. 
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Subscribe to News from Science for unlimited 
access to authoritative, up-to-the-minute news 
on research and science policy. 


bit.ly/NewsFromScience 
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TRILLIONS 
OF MICROBES 


ONE ESSAY 


The NOSTER Science Microbiome Prize 
is an international prize that rewards 
innovative research by _ investigators 
who have completed their terminal de- 
gree in the last 10 years and are work- 
ing on the functional attributes of | 
microbiota. The research can include 
any organism with the potential to con- 
tribute to our understanding of human 
or veterinary health and disease, or to . 


guide therapeutic interventions. The 
winner and finalists will be chosen by Jennifer Hill, Ph.D. \. 


a committee of independent “scien- evemNinner 
tists” chaired by a senior editor of Science. The top prize includes 
a complimentary membership to AAAS, an online subscription 
to Science, and USD 25,000. Submit your research essay today. 
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Apply by 24 January 2023 at www.science.org/noster 


Sponsored by Noster Inc 


